Post a Job

Senior AI Research Engineer, Model Inference

Unlock employer Dubai, United Arab Emirates Posted: 12 Nov 2025

Apply Direct

Financial

Estimate: $90k - $120k*
Zero income tax location

Accessibility

Fully Remote
Apply from abroad
Visa Provided

Requirements

Experience: Senior
English: Professional

Explore similar roles:

View AI Research Scientist jobs in Dubai · View all AI Research Scientist jobs

Position

Join the company and shape the future of digital finance. At the company, we are not just building products; we are pioneering a global financial revolution. Our solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. We harness the power of blockchain technology to enable secure and instantaneous digital token transactions at a fraction of the cost, with transparency as the foundation of our operations.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

Responsibilities:

Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Investigate and resolve GPU acceleration issues on Vulkan and integrated/mobile GPUs.
Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
Conduct evaluation and benchmarking, including perplexity testing and fine-tuned adapter performance.
Deliver production-grade, efficient language model deployment for mobile and edge use cases.
Collaborate with research and engineering teams to scale new model optimization methods.

Qualifications:

Proficiency in C++ and GPU kernel programming.
Proven expertise in GPU acceleration with the Vulkan framework.
Strong background in quantization and mixed-precision model optimization.
Experience in Vulkan compute shader development and customization.
Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon, etc.).
Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
Hands-on experience with mobile GPU acceleration and model inference.

Language Requirements:

Excellent English communication skills.

Important Information for Candidates:

Only apply through official channels.
Verify the recruiter’s identity through verified profiles.
Be cautious of unusual communication methods; interviews are conducted through official company emails.
We will never ask for payment or personal financial details during the hiring process.

This is your opportunity to collaborate with some of the brightest minds in the fintech space at the company, where innovation meets human potential.

Apply Direct