G42 Circular Logo

Lead System Engineer

G42 Abu Dhabi, United Arab Emirates Posted: 05 Sep 2024

Financial

  • Estimate: $150k - $200k*
  • Zero income tax location

Accessibility

  • Hybrid
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

About the Job
The Lead Systems Engineer - Storage Infrastructure engages in the design, leads implementation, and provides Level 3 expert support for extra-large scale storage infrastructure, ensuring the highest levels of performance, scalability, and reliability. This role is a key subject matter expert, leading a team of engineers responsible for block, object, or file storage solutions and backup services.

Company Overview
G42 is an Abu Dhabi-based artificial intelligence and cloud computing company with a global footprint, delivering holistic and scalable AI solutions to a variety of commercial and government clients. The Group’s business operations cover multiple industry verticals including Healthcare, Government, Smart City & Smart Mobility, Oil & Gas, Fintech, Geospatial, Aviation, and Big Data Analytics.

Responsibilities

  • Co-design, lead implementation, and management of PBs-level block, object, and file storage solutions as integral components of Cloud and HPC environments, ensuring stability, performance, and compliance with industry standards and best practices.
  • Define and oversee the implementation of a roadmap for all storage solutions and services across the company.
  • Collaborate with architecture and other engineering teams on storage and backup technology component evaluation and selection, ensuring solutions are designed following best practices and are optimized from both functional and non-functional perspectives.
  • Lead regular capacity planning exercises to anticipate and accommodate the growing demands on the storage infrastructure, ensuring it meets current and future requirements.
  • Develop and oversee plans to enhance the reliability of the storage infrastructure, addressing potential points of failure and ensuring high availability of storage services.
  • Explore, analyze, and implement performance optimization strategies for the storage solutions, ensuring optimal resource utilization and performance.
  • Lead evaluation and integration of advanced storage technologies and methodologies, such as SDS to enhance features, performance, and efficiency.
  • Define and oversee execution of disaster recovery strategies ensuring data integrity, availability, and protection across all platforms and environments.
  • Design and enhance observability stack in collaboration with the IaaS operations team ensuring monitoring coverage and accuracy.
  • Provide L3 expert support including on-call shifts and serve as the final tier of resolution for L2 support teams through problem analysis and communication with vendor technical support.
  • Lead and mentor a team of storage engineers and collaborate with other platform engineering teams on solution design and delivery.
  • Collaborate with security management teams to ensure that systems are safe and secure against cybersecurity threats.
  • Write and maintain relevant documentation ensuring completeness and quality.
  • Work closely with process management and operational teams and contribute to process development, standardizing collaboration frameworks and improving collaboration efficiency.

Qualifications

  • Bachelor’s or master’s degree in computer science, engineering, software engineering, or a related field in technology.
  • 2+ years of experience leading a team of 3+ engineers holding accountability for the quality and timely delivery of infrastructure projects.
  • 7+ years of experience with deep expertise in designing, implementing, and managing large-scale software-defined storage (SDS) solutions providing block, object, or file storage services and backup capabilities.
  • In-depth hands-on experience in system implementation, management, and optimization of storage systems from leading vendors, including but not limited to HPE, Dell, NetApp, Hitachi, IBM, PureStorage, or VAST Data.
  • Deep knowledge of different storage protocols providing block, object, and file storage interfaces such as iSCSI, S3, NFS, FC[oE], NVME over TCP, etc.
  • Proficient with Linux/Linux kernel and storage stack and capable of debugging related issues.
  • Advanced experience in managing object storage solutions based on SeaweedFS, MinIO, Cloudian HyperStore, Qumulo S3, Scality Ring, or Dell ECS.
  • Experience with cloud-native backup solutions for OpenStack is highly desirable.
  • Experience in designing and managing clustered/parallel file systems such as Lustre, GPFS, etc. is highly desirable.
  • Familiarity with containerization technologies (Openshift, Docker, Kubernetes) and container storage technologies (Rook, CSI, PVC).
  • Familiarity with integration of identity management, access management, and authorization solutions (PKI, LDAP, OAUTH, OpenID).
  • In-depth knowledge of backup systems, disaster recovery principles, and data protection strategies.
  • Knowledge of load balancer technologies for object storage solutions.
  • Solid knowledge of Data center network design and related technologies (OSI model, TCP/IP stack, firewalling, routing, VLAN/VxLAN).
  • Hands-on experience with monitoring and observability tools like Zabbix, Nagios, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana).
  • Understanding of CI/CD principles, Infrastructure as Code (IaaC) approach, and software-defined infrastructure solutions.
  • Advanced programming and scripting skills using Python and/or Golang, and bash.

What We Look For
We seek performance-driven, inquisitive minds with the agility to adapt to ambiguity. Candidates should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. A passion for conquering new frontiers in the AI space is at the heart of our community.

What We Offer

  • Culture: An open, diverse, and inclusive environment focused on groundbreaking, industry-first innovations.
  • Career: Outstanding learning, development, and growth opportunities via structured training programs and innovative, high-tech projects.
  • Work-Life: A hybrid work policy to strike the perfect balance between office and home.
  • Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits, and more.

If you can confidently demonstrate that you meet the criteria above, please contact us as soon as possible.

Apply now

Jobs you might like   View all jobs

About G42

A leading AI & Cloud Computing company based in Abu Dhabi, committed to inventing a better everyday through the power of people and technology.