Exploring the frontiers of Computer Vision and Deep Learning
Vision-language models and cross-modal understanding systems
Image analysis, object detection, and visual understanding systems
Learning representations from unlabeled data using contrastive methods
Neural networks, representation learning, and model optimization techniques
My current research focuses on advancing the robustness and efficiency of deep learning models through novel self-supervised learning approaches and multi-modal understanding. I am particularly interested in developing methods that can learn from minimal supervision while maintaining strong performance across diverse domains.