About the Job
The Lead Systems Engineer - Computing Technology engages in the design, leads implementation, and provides Level 3 expert support for large-scale private Cloud computing and/or HPC infrastructure, focusing on computing technologies including hardware layer, operating system, hypervisor, and orchestration services.
Location
Abu Dhabi, ARE
Responsibilities
- Co-design, lead implementation, and manage hybrid virtualization and containerized platforms based on OpenStack, VMware VCF, and/or Red Hat OpenShift, ensuring platform stability, performance, and compliance with industry standards and best practices.
- Define and oversee the implementation of the roadmap for all Virtualization and HPC platforms across the company.
- Collaborate with architecture and engineering teams on technology stack component evaluation and selection, ensuring solutions are designed following best practices and optimized from both functional and non-functional perspectives.
- Lead regular capacity planning exercises to accommodate the growing demands on the virtualized environment and HPC infrastructure.
- Develop and oversee plans to enhance the reliability of the computing infrastructure, ensuring high availability of services.
- Lead regular performance assessments and implement improvements based on findings.
- Define and oversee execution of disaster recovery strategies ensuring system integrity and availability.
- Design and enhance observability stack in collaboration with the infrastructure operations team.
- Provide L3 expert support, including on-call shifts, and act as the final tier of resolution for L2 support teams.
- Mentor a team of engineers and collaborate with other infrastructure engineering teams on solution design and delivery.
- Work closely with security management teams to ensure systems are secured against cybersecurity threats.
- Write and maintain relevant documentation, ensuring quality and completeness.
- Participate in the hiring process by conducting technical interviews.
Qualifications
- Bachelor’s or master’s degree in computer science, Engineering, Software Engineering, or a related field.
- 2+ years of experience leading a team of 3+ engineers in infrastructure projects.
- 7+ years of deep expertise in designing, implementing, and managing private cloud stacks with a focus on compute and virtualization technologies.
- Extensive hands-on experience with platforms such as OpenStack, VMware VCF, and Red Hat OpenShift.
- 7+ years of hands-on experience in Linux Environments and 3+ years in Senior Systems or Infrastructure engineering roles.
- Profound understanding of hardware architecture and components.
- Good understanding of network and storage types and architecture.
- Experience in managing large-scale public or private cloud environments is highly desirable.
- Advanced programming and scripting skills using Python and/or Golang.
- Experience with monitoring and observability tools like Zabbix, Grafana, ELK Stack.
- Understanding of CI/CD principles, Infrastructure as Code (IaaC) approach, and software-defined infrastructure solutions.
- Strong organizational skills with the ability to multitask and prioritize.
- A proactive approach to problem-solving and decision-making.
This job offers an exciting opportunity to contribute to cutting-edge cloud computing technologies while leading a talented team of engineers.