Company logo hidden

AI Infrastructure Engineer

Unlock employer Dubai, United Arab Emirates Posted: 30 Jun 2026

Financial

  • Estimate: $80k - $120k*
  • Zero income tax location

Accessibility

  • Fully Remote
  • Apply from abroad
  • No Visa Provided

Requirements

  • Experience: Intermediate
  • English: Professional

Position

About the Role
You will own the inference backbone behind the company's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long-session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low-level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on-device AI experiences and helps set the technical foundation for the company's next generation of peer-to-peer AI products.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

About the Job
You'll work on the C++ layer that powers local AI, porting and enhancing inference engines like llama.cpp or similar, to run efficiently on edge devices. Your focus is on the runtime: making models load faster, run leaner, and perform well across different hardware. You'll ensure that the inference layer is stable, optimized, and ready for integration with the rest of the stack. This role is for engineers who want to work close to the metal, enabling private and fast on-device AI without relying on cloud infrastructure.

Responsibilities

  • Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml
  • Collaborate closely with researchers to assist in coding, training, and transitioning models from research to production environments
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning

Requirements

  • Excellent programming skills in C++, experience in Javascript is a bonus
  • Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures
  • Good understanding of deep learning concepts and model architectures
  • Experience with transformers, LLMs, Diffusion models
  • Demonstrated ability to rapidly assimilate new technologies and techniques
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D

Location
United Arab Emirates, Dubai

Apply Direct

Jobs you might like   View all jobs

Ready to apply for this role?

Apply Direct