Post a Job

Reinforcement Learning Engineer

Unlock employer Dubai, United Arab Emirates Posted: 09 Oct 2025

Apply Direct

Financial

Estimate: $80k - $120k*
Zero income tax location

Accessibility

Office Only
Apply from abroad
Visa Provided

Requirements

Experience: Intermediate
English: Professional

Explore similar roles:

View Data Scientist jobs in Dubai · View all Data Scientist jobs

Position

Technology Innovation Institute (TII) is a publicly funded research institute based in Abu Dhabi, United Arab Emirates, dedicated to transforming challenges into pioneering research and technology prototypes to advance society. This role falls within TII's Robotics Research Center.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

We are seeking a talented Reinforcement Learning (RL) Engineer with expertise in developing and deploying RL solutions for robotics, swarm intelligence, and drone systems. The ideal candidate will have a strong foundation in both theoretical RL and practical implementation of algorithms in real-world environments. You will design novel RL architectures, integrate advanced methodologies, and build scalable systems capable of handling complex distributed control problems.

Key Responsibilities:

RL Algorithm Development & Integration: Design, implement, and optimize RL algorithms for robotic platforms, UAV swarms, and autonomous agents; integrate and implement RL solutions for long-horizon planning and decision-making.
Multi-Agent Reinforcement Learning (MARL): Build and evaluate MARL frameworks for coordination, deconfliction, and cooperative decision-making in multi-drone systems.
Engineering & Deployment: Implement efficient training pipelines for large-scale RL simulations; optimize performance in simulation-to-real transfer for robotics and aerial vehicles.
Research & Innovation: Stay up to date with state-of-the-art RL methodologies; investigate hybrid learning paradigms (e.g., neurosymbolic methods, model-based/model-free hybrids).

Core Competencies:

Reinforcement Learning Expertise: Strong understanding of policy-gradient methods, Q-learning, actor-critic frameworks, and hierarchical RL; hands-on experience with MARL, federated learning, centralized vs decentralized control, and memory-augmented policies.
Development Tools & Libraries: Proficient in RL frameworks such as Ray RLlib, Stable Baselines3, and simulation environments like PyBullet, Isaac Gym, Gazebo, MuJoCo, AirSim; AI frameworks: PyTorch, TensorFlow, JAX.
Programming Skills: Proficient in Python and C++ for RL research, prototyping, experimentation, middleware integration (e.g., ROS2), and real-time control.
Systems & Infrastructure: Proficiency with Docker, distributed training systems, and GPU clusters; familiarity with CUDA and large-scale simulation pipelines; experience deploying RL models in robotics middleware (ROS2, PX4, MAVSDK).

Qualifications:

Master’s or PhD in Computer Science, Robotics, AI/ML, or a related field.
Proven track record of implementing RL algorithms for robotics or UAV applications.
Strong expertise in multi-agent systems, swarm robotics, and real-world control.
Experience bridging simulation and real-world deployment.
Excellent problem-solving ability and research-driven mindset.

Preferred Qualifications:

Experience with safety-aware or constrained RL for critical systems.
Background in distributed optimization, graph-based learning, or networked systems.
Contributions to open-source RL or robotics frameworks.
Publications in AI/robotics conferences.

At TII, we help society overcome its biggest challenges through rigorous scientific discovery and inquiry, utilizing state-of-the-art facilities and collaboration with leading international institutions.

Apply Direct