Company logo hidden

VLM Engineer

Unlock employer Dubai, United Arab Emirates Posted: 29 Mar 2026

Financial

  • Estimate: $80k - $120k*
  • Zero income tax location

Accessibility

  • Office Only
  • Apply from abroad
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

About the Job: The company is a publicly funded research institute based in Abu Dhabi, United Arab Emirates. It is home to a diverse community of leading scientists, engineers, mathematicians, and researchers from across the globe, dedicated to transforming problems into pioneering research and technology prototypes that advance society. As part of the company’s Artificial Intelligence Research Center, the Extreme-Scale Language Model team is developing and implementing innovative deep learning technologies applicable in various fields, including Natural Language Processing, Perception, and Vision. The team is known for developing the Falcon models and is continuing its journey into cutting-edge applied research on large language models. Key Responsibilities: - Vision Model Ablation Studies: Conduct comprehensive ablation studies to assess the impact of various components and configurations on vision models. - Data Ablation Research: Perform data ablation studies to identify optimal data types for training vision-language models and analyze the impact of different data inputs on model performance. - Model Evaluation: Develop robust evaluation protocols for assessing the performance of vision-language models across diverse benchmarks and real-world scenarios. - Model Training and Optimization: Engage in model training, focusing on integrating large language models with vision models like CLIP. Technical Skills Required: - Expertise in machine learning, particularly with vision-language models and large language models (LLMs). - Strong understanding of model architectures, especially CLIP, and their application in vision-language tasks. - Proficiency in distributed training techniques and multi-GPU optimization. - Experience with deep learning frameworks (e.g., PyTorch). - Strong analytical skills for conducting ablation studies and evaluating model performance. - Familiarity with dataset curation and processing for vision and language tasks. Qualifications: - PhD in deep learning. - Proven track record of research and development in vision-language models. - Publication record in top-tier conferences is highly desirable. At the company, we help society to overcome its most significant hurdles through rigorous scientific inquiry and collaboration with leading international institutions. Our work focuses on groundbreaking advancements in AI, advanced materials, autonomous robotics, cryptography, digital security, directed energy, quantum computing, and secure systems.

Apply Direct

Jobs you might like   View all jobs

Ready to apply for this role?

Apply Direct