About Me
I am a Machine Learning Scientist at Amazon Prime Video, where I develop Video-Language Foundational Models that bridge visual and linguistic understanding at scale.
Previously, I was a Postdoctoral Research Fellow at the Stanford AI Lab (Stanford University), working under the guidance of Dr. Stefano Ermon. I am deeply grateful to Dr. Ermon for his exceptional mentorship and support.
My research at Stanford spanned efficient convolutional networks optimized for run-time complexity, unsupervised & weakly supervised learning for improved sample efficiency, generative models, and machine learning for computational sustainability.
I earned my Ph.D. from the Chester F. Carlson Center for Imaging Science at Rochester Institute of Technology, advised by Dr. Matthew J. Hoffman.
🎓 Education
Ph.D.
Chester F. Carlson Center for Imaging Science, RIT
2011 – 2016
Aerial Vehicle Detection and Tracking using a Multi-modal Adaptive Sensor
M.S.
Electrical & Computer Engineering, University of Bridgeport
2009 – 2011
Non-speech Environmental Sound Classification with Pitch Range-based Features
B.S.
Electrical & Electronics Engineering, Eskişehir Osmangazi University
2004 – 2009
Autonomous Parallel Parking of Non-holonomic Vehicles
💼 Professional Experience
Apr 2022 – PresentMachine Learning Scientist
Amazon Prime Video
Video-Language Foundational Models
Nov 2020 – Apr 2022Sr. Research Scientist
Samsung Research America
Vision Transformer Compression · Multimodal Understanding
Jul 2018 – Oct 2020Postdoctoral Research Fellow
Stanford University — Stanford AI Lab
Self-Supervised Learning · Dynamic Models · Generative Models · Computational Sustainability
Jun 2017 – Jul 2018Computer Vision Engineer
Planet Labs
Convolutional Object Detection in Low-Resolution Aerial Imagery
Aug 2016 – Jun 2017Computer Vision Engineer
Autel Robotics
High-Speed Object Tracking on Low-End Embedded Systems
Nov 2015 – May 2016Computer Vision Algorithm Engineer Intern
Huawei R&D
Unsupervised Semantic Role Assignment in Photo Albums
🔬 Research Interests
🎥 Video-Language Understanding ⚡ Efficient Deep Learning & Model Compression 🤖 Multimodal Machine Learning 🌱 Computational Sustainability 🎨 Generative Models