Presight Circular Logo

Lead Site Reliability Engineer

Presight Abu Dhabi, United Arab Emirates Posted: 02 Jul 2025

Financial

  • Estimate: $90k - $120k*
  • Zero income tax location

Accessibility

  • Office Only
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

About the Job: The opportunity seeks a meticulous and expert Lead Engineer - Site Reliability to build and support the Presight delivery model. This role empowers product and technology teams to develop and deliver high-quality products, enhance platform infrastructure, and strengthen the reliability of solutions. The Lead Engineer will play a crucial role in defining and establishing the delivery model for cutting-edge analytics solutions and services.

About Presight: Presight is a leading big data analytics company in the region, powered by Artificial Intelligence (AI) and primarily backed by Abu Dhabi company G42. The company utilizes its expertise in big data, analytics, and AI to serve various sectors and scales, aiming to create positive business and societal impacts. Presight's advanced computer vision, AI, and omni-analytics platform support insight-driven decision-making that shapes policy for safer, healthier, and more sustainable societies.

Responsibilities:

  • Work collaboratively with stakeholders to drive reliability, performance, and scalability across infrastructure.
  • Own the SRE roadmap and guide implementation through mentorship, code contributions, and hands-on work.
  • Partner with Engineering, Data Science, and Product teams to embed reliability into the development lifecycle.
  • Define and enforce SLOs, SLIs, and error budgets with engineering leadership.
  • Lead incident response and root cause analysis.
  • Implement automation to reduce toil and improve system resilience.
  • Manage capacity planning, traffic forecasting, and cost optimization.
  • Mentor junior and senior SREs in technical and process excellence.
  • Collaborate with MLOPS, DevSecOps, and CloudOps teams to enforce best practices.
  • Champion observability, metrics-driven decisions, and platform maturity.
  • Deploy monitoring tools such as Prometheus and Grafana to track system performance.
  • Ensure system reliability adheres to security and compliance standards.
  • Comply with QHSE (Quality Health Safety and Environment), Business Continuity, Information Security, Privacy, Risk, Compliance Management, and Governance policies.

Qualifications:

  • Required Skills:

    • Bachelor's Degree in Computer Engineering or related field.
    • Minimum 10 years of experience in site reliability, with 2 years in people management.
    • Expertise in Kubernetes, CI/CD (e.g., GitLab), and infrastructure-as-code (Terraform/Helm).
    • Strong experience in cloud (Azure, AWS, or GCP).
    • Experience with multi-tenant systems or high-throughput data platforms.
    • Exposure to AI/ML infrastructure or MLOps pipelines.
    • Proven background in SRE principles, SLIs/SLOs, and reliability-focused engineering.
    • Programming proficiency in Python or Shell (nice to have).
    • Deep understanding of distributed systems, networking, and incident management.
  • Personal Traits:

    • Highly detail-oriented and methodical approach to problem-solving.
    • Passion for technology, troubleshooting, and customer service.
    • Strong analytical mind.
    • Excellent verbal and written communication skills.

What We Offer:

  • Culture: An open, diverse, and inclusive environment focused on innovation and personal growth.
  • Career Growth: Opportunities for accelerative career growth through high-impact projects and resources for continuous learning.
  • Rewards: Competitive remuneration package with various perks, including healthcare, education support, and leave benefits.

If you are eager to excel in AI and thrive in a dynamic environment, we welcome you to join our community at Presight.

Apply now

Jobs you might like   View all jobs

About Presight

Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). We combine big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With our world-class computer vision, AI and omni-analytics platform as its engine, we excel at all-source data interpretation to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.