Post a Job

AI Infrastructure Engineer

Unlock employer Dubai, United Arab Emirates Posted: 30 Jun 2026

Apply Direct

Financial

Estimate: $80k - $120k*
Zero income tax location

Accessibility

Fully Remote
Apply from abroad
No Visa Provided

Requirements

Experience: Intermediate
English: Professional

Explore similar roles:

View Machine Learning Engineer jobs in Dubai · View all Machine Learning Engineer jobs

Position

About the Role
You will own the inference backbone behind the company's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long-session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low-level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on-device AI experiences and helps set the technical foundation for the company's next generation of peer-to-peer AI products.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

About the Job
You'll work on the C++ layer that powers local AI, porting and enhancing inference engines like llama.cpp or similar, to run efficiently on edge devices. Your focus is on the runtime: making models load faster, run leaner, and perform well across different hardware. You'll ensure that the inference layer is stable, optimized, and ready for integration with the rest of the stack. This role is for engineers who want to work close to the metal, enabling private and fast on-device AI without relying on cloud infrastructure.

Responsibilities

Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml
Collaborate closely with researchers to assist in coding, training, and transitioning models from research to production environments
Integrate AI features into existing products, enriching them with the latest advancements in machine learning

Requirements

Excellent programming skills in C++, experience in Javascript is a bonus
Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures
Good understanding of deep learning concepts and model architectures
Experience with transformers, LLMs, Diffusion models
Demonstrated ability to rapidly assimilate new technologies and techniques
A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D

Location
United Arab Emirates, Dubai

Apply Direct