Company logo hidden

Lead Site Reliability Engineer

Unlock employer Abu Dhabi, United Arab Emirates Posted: 08 Jul 2025

Financial

  • Estimate: $90k - $120k*
  • Zero income tax location

Accessibility

  • Office Only
  • Apply from abroad
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

Presight, an ADX-listed public company majority-owned by Abu Dhabi company G42, is a leading big data analytics company powered by Artificial Intelligence (AI). The company focuses on creating business and positive societal impact through its expertise in big data, analytics, and AI.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

The Opportunity:
Presight is seeking a meticulous and expert Lead Engineer - Site Reliability to build and support a delivery model that empowers product and technology teams. This role involves developing and delivering high-quality products, improving platform infrastructure, and strengthening the reliability of products and solutions. The Lead Site Reliability Engineer will play a key role in defining and establishing the delivery model for next-generation analytics solutions and services.

Responsibilities:

  • Drive reliability, performance, and scalability across the infrastructure in partnership with stakeholders.
  • Own the SRE roadmap and guide implementation through mentorship and hands-on work.
  • Functional architect and lead reliability strategies across services and environments.
  • Define and enforce SLOs, SLIs, and error budgets with engineering leadership.
  • Lead incident response and perform root cause analysis.
  • Implement automation to reduce toil and improve system resilience.
  • Manage capacity planning, traffic forecasting, and cost optimization.
  • Mentor junior and senior SREs in technical and process excellence.
  • Collaborate with MLOPS, DevSecOps, and CloudOps teams to enforce best practices.
  • Champion observability, metrics-driven decisions, and platform maturity.
  • Deploy monitoring tools such as Prometheus and Grafana to track system performance.
  • Ensure system reliability adheres to security and compliance standards.

Qualifications:

  • Bachelor's Degree in Computer Engineering or a related field.
  • Minimum 10 years of experience in site reliability with 2 years in people management.
  • Expertise in Kubernetes, CI/CD (e.g., GitLab), and infrastructure-as-code (Terraform/Helm).
  • Strong experience in cloud platforms (Azure, AWS, or GCP).
  • Experience with multi-tenant systems or high-throughput data platforms.
  • Exposure to AI/ML infrastructure or MLOps pipelines.
  • Proven background in SRE principles, SLIs/SLOs, and reliability-focused engineering.
  • Programming proficiency in Python or Shell (nice to have).
  • Deep understanding of distributed systems, networking, and incident management.
  • A highly detail-oriented and methodical approach to problem solving.
  • Strong analytical mindset, with excellent verbal and written communication skills.

What We Look For:
Join Presight for a culture of innovation, outstanding career growth opportunities, and competitive rewards. We welcome candidates eager to advance in AI and thrive in a dynamic environment.

What Working At Presight Offers:

  • Culture: An open, diverse, and inclusive environment with a global vision.
  • Career: High-impact projects with resources for continuous growth and learning.
  • Rewards: Competitive remuneration package with benefits including healthcare, education support, and leave benefits.
Apply Direct

Jobs you might like   View all jobs

Ready to apply for this role?

Apply Direct