Research
🔬 Research
Generative AI, video-language understanding, efficient deep learning, and machine learning for sustainability.
🎥 Video-Language & Long-Form Video
Foundational models and systems for long-form video understanding, captioning, ranking, and mitigating hallucinations in video-language models.
- From Frames to Clips · ECCV 2026
- Learning to Rank Caption Chains · ECCV 2026
- CounterVid · EMNLP 2026
- Narrative Aligned Long Form Video QA · CVPR Workshop 2026
âš¡ Efficient & Multimodal Models
Dynamic inference, structured pruning, weight sharing, and lightweight detectors for grounding-based vision-and-language models.
- Dynamic Inference with Grounding Based V&L Models · CVPR 2023
- GOHSP: Graph & Optimization-based Structured Pruning · AAAI 2023
- Learning to Jointly Share and Prune Weights · ICLR 2023
- Lite-MDETR · CVPR 2022
- Efficient Conditional Pre-training for Transfer Learning · CVPR Workshop 2022
- Augment the Pairs · WACV 2024
- Multimodal Benchmark for Zero-Shot Learning · WACV 2024
- Efficient High Resolution Image Processing with Deep RL · AAAI 2021
🌱 Computational Sustainability
Remote sensing, geolocated data, self-supervised learning, and interpretable models for agriculture, poverty mapping, and environmental monitoring.
- Geography-Aware Self-Supervised Learning · ICCV 2021
- Learning How to Interpret Satellite Images using Wikipedia · IJCAI 2020
- Learning When and Where to Zoom · CVPR 2020 (Oral)
- Predicting Geo-attributes with Street-Level Images · AAAI 2021
- Predicting Economic Development using Geolocated Wikipedia · KDD 2019
- Farmland Parcel Delineation · CVPR Workshop 2020
- Cloud Removal from Satellite Images · WACV 2020
- Open datasets · MapillaryGCN, cloud removal, WAMI
🎨 Generative Models
Generative modeling for satellite imagery, data augmentation, and counterfactual video generation for robust VLM evaluation.
- CounterVid: Counterfactual Video Generation · EMNLP 2026
- Negative Data Augmentation · ICLR 2021
- Cloud Removal (Spatiotemporal GAN) · WACV 2020
- Cloud removal code · GitHub
See the full publication list or Google Scholar profile for a complete bibliography.
