About the Job
The Senior Systems Engineer engages in the design, leads implementation, and provides Level 3 expert support for large-scale private cloud environments, with a specific emphasis on PaaS and SaaS applications and services, their interoperability, and integration with underlying infrastructure components.
Ready to apply for roles like this?
Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.
Unlock employer & apply directly
Core42 is the UAE’s national-scale enabler for cloud and generative AI, combining expertise across multiple technology disciplines into a single platform for public sector and large enterprise transformations. Building on capabilities as a sovereign cloud and HPC specialist, the company brings generative AI, cybersecurity, and professional managed services expertise to enable national-scale program deployments across industries.
Responsibilities
- Co-design, implement, and manage applications and services for hybrid virtualization and containerized platforms based on OpenStack, VMware VCF, and/or Red Hat OpenShift, ensuring platform stability, performance, and compliance with industry standards and best practices.
- Collaborate with architecture and engineering teams on technology stack component evaluation and selection, ensuring solutions are designed following best practices and optimized from both functional and non-functional perspectives.
- Develop and implement plans to enhance the reliability of the applications and services infrastructure, addressing potential points of failure and ensuring high availability of services.
- Conduct regular performance assessments and implement improvements based on findings.
- Prepare and participate in complex changes to production environments supporting operational teams.
- Develop auto-test and automation solutions for cloud platforms using tools like Jenkins and Selenium, along with other configuration management tools such as Terraform, Ansible, Puppet, Chef, and GitLab CI/CD.
- Provide L3 expert support including on-call shifts focused on immediate incident management and resolutions, such as outages, breaches, and system failures.
- Write and maintain relevant documentation ensuring completeness and quality.
- Prepare and conduct training for operational teams in related technical domains.
- Collaborate with security management teams to ensure systems are secure against cybersecurity threats.
- Work closely with process management and operational teams to contribute to process development, standardizing collaboration frameworks, and improving collaboration efficiency.
Qualifications
To qualify for the role, you must have a Bachelor’s or Master’s degree in Computer Science, Engineering, Software Engineering, or another relevant technology field, along with:
- 7+ years of hands-on experience in Linux environments and 5+ years in a senior systems engineering role.
- Experience in designing, deploying, and managing Kubernetes and/or OpenShift clusters with a deep understanding of Kubernetes architecture and ecosystem.
- Familiarity with virtualization technologies like OpenStack and/or VMware, and computing technologies such as x86 hardware, OS, KVM/ESXi.
- In-depth knowledge of frontend, application, and middleware technologies such as Apache, Nginx, Kafka, and RabbitMQ.
- Experience with deploying scalable solutions supporting enterprise-level applications.
- Proficiency in IAM protocols such as SAML, OAuth 2.0, and OpenID Connect, and experience with secure single sign-on (SSO) implementations.
- Understanding of CI/CD principles, Infrastructure as Code (IaC) approaches, and experience with tools such as Terraform and Ansible.
- Proficiency in scripting languages like Python or Bash for automation.
- Solid understanding of security practices and tools, including experience with security scanning tools and implementation of best practices.
- Experience with database management and optimization for both SQL and NoSQL databases.
- Knowledge of monitoring and observability tools like Prometheus and Grafana.
- Ability to design and implement disaster recovery and high availability strategies.
- Practical knowledge of network protocols (TCP/IP, HTTP, SSL/TLS) and network security measures.
- Strong project management skills with experience in agile methodologies and tools such as JIRA.
- Relevant certifications are highly desirable.
What We Look For
We seek a performance-driven individual with a proactive approach to problem-solving and decision-making. An eagerness to build meaningful collaborations and a passion for exploring new frontiers in the AI space are essential traits that align with Core42's community.
What Working at Core42 Offers
- Culture: An open, diverse, and inclusive environment focused on groundbreaking innovations.
- Career: Opportunities for learning and development through structured training programs and innovative projects.
- Work-Life: A hybrid work policy that balances office and home life.
- Rewards: A competitive remuneration package that includes healthcare, education support, and generous leave benefits.
If you believe you meet the above qualifications, please reach out to us.