G42 Circular Logo

Cloud Observability Engineer

G42 Abu Dhabi, United Arab Emirates Posted: 23 May 2024

Financial

  • Salary unspecified
  • Zero income tax location

Accessibility

  • Hybrid
  • Apply from abroad
  • Relocation Support
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

Overview: With operations in 27 countries across 4 continents, M42 is at the epicenter of AI innovation in healthcare, driving the Emirati Genome Program, the world's largest population genome program. As part of a cross-functional and internationaly-trained team, you'll be working on cutting-edge AI projects that have a significant impact on global healthcare.

Responsibilities:

  • Design, implement, and maintain observability solutions on the Azure platform using tools such as Grafana Azure Monitor, Application Insights, Azure Log Analytics, and open source tools
  • Develop custom metrics, logs, and traces to monitor the health, performance, and availability of Azure resources, applications, and services
  • Configure and manage alerts, notifications, and dashboards to enable real-time monitoring and incident response
  • Monitor trends in system behavior over time to proactively address issues
  • Collaborate with software development teams to instrument code for logging, tracing, and telemetry collection
  • Troubleshoot and debug issues related to performance, availability, and scalability in Azure environments
  • Automate monitoring, logging, and alerting workflows using scripting languages and infrastructure-as-code tools
  • Drive efficient areas of improvement end-to-end with stakeholders
  • Maintain suitable deployment-specific SOPs, templates, and define processes
  • Produce accurate and consistent engineering and architectural HLD and LLD documentations
  • Contribute to maintaining company values and culture by working collaboratively across cross-cultural teams
  • Occasionally provide weekend support as per project demand

Qualifications:

  • Minimum 5+ years’ experience in Cloud Infrastructure Monitoring and DevOps
  • In-depth knowledge of Unix/Linux systems, including system internals, file systems, and network protocols
  • Experience with monitoring tools (e.g., Prometheus, Grafana) and logging systems (e.g., ELK stack)
  • Ability to analyze system performance and plan for future capacity requirements
  • Proficiency in alerting tools such as Prometheus, Grafana, Nagios, or others
  • Working knowledge in Graphana, PowerBI, SQL, scripting, Python, Azure Monitoring, ITIL process
  • Experience in developing Continuous Integration/ Continuous Delivery pipelines (CI/ CD)
  • Strong background in Linux, Windows, Storage, Network devices Administration
  • Hands-on knowledge of Dockers and Container Orchestration tools such as Kubernetes, Rancher, Docker Swarm
  • Vendor Certification Azure/VMware/AWS is Mandatory

What we look for: If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the M42 community.

What working at M42 offers:

  • Culture: An open, diverse, and inclusive environment with a global vision that encourages personal growth and focuses on groundbreaking, industry-first innovations
  • Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects
  • Work-Life: A hybrid work policy to strike the perfect balance between office and home
  • Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more
Apply now

About G42

A leading AI & Cloud Computing company based in Abu Dhabi, committed to inventing a better everyday through the power of people and technology.