Company logo hidden

Site Reliability / DevOps Engineer

Unlock employer Riyadh, Saudi Arabia Posted: 01 Aug 2024

Financial

  • Estimate: $70k - $120k*
  • Zero income tax location

Accessibility

  • Office Only
  • Visa Provided

Requirements

  • Experience: Unspecified

Position

About the Job
Site Reliability and DevOps Engineering is a key requirement within the Rackspace support infrastructure, providing a prominent level of technical support to customers via phone, ticketing systems, and automation. This role is responsible for managing complex customer issues that may take several days or weeks to resolve, maintaining ongoing communication to keep customers updated. The position provides a framework for system development, maintenance, and enhancements, while implementing standards and guidelines of Site Reliability Engineering (SRE) support.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

As a member of the SRE team, you will operate our private cloud and develop tools and integrations for a portfolio of cloud infrastructure services. Your expertise in private cloud will drive improvements in operations and releases through code, utilizing common open-source observability tools like ELK and Grafana for proactive alerts to measure and maintain Service Level Objectives. Collaboration with the Tier 1 team on escalations will offer opportunities for automation and process improvements.

This is an onsite position based in the Riyadh region of KSA.

Responsibilities

  • Develop and deliver software to build and enhance the functionality, reliability, availability, and manageability of applications and cloud platforms using a DevOps model for On-Prem environments (OpenStack and Kubernetes).
  • Automate development, testing, and deployment processes through CI/CD pipelines (Git, GitLab, Helm, ArgoCD) to support various architectures.
  • Work with Tier 1 support to handle system and customer escalations.
  • Troubleshoot and resolve infrastructure-related issues, including those associated with Kubernetes clusters, applications, workloads, and networks.
  • Collaborate with software engineering teams to optimize application performance and reliability.
  • Ensure the reliability and continuity of Kubernetes or Virtual Machine (VM) workloads.
  • Continuously evaluate and improve systems and processes to enhance reliability, performance, and efficiency.
  • Stay updated with industry trends, best practices, and emerging technologies in SRE and DevOps.

Role Requirements

  • Solid background in private cloud infrastructure (OpenStack, VMware, Kubernetes) with operational and troubleshooting experience.
  • Familiarity with the software development lifecycle including development, testing, packaging, deployment, upgrade, and support.
  • Direct experience in private cloud and Kubernetes resource development and operations, including knowledge of major OpenStack components (e.g., Keystone, Nova, Neutron, Glance) and Kubernetes components (e.g., CNI, CRI, CSI, control plane).
  • Proficiency in software development using Python, including the ability to write patches for OpenStack and workload manifests in YAML for Kubernetes.
  • Experience supporting software-defined storage solutions such as Ceph or other cloud-based storage systems, as well as Kubernetes Container Storage interfaces like Portworx, MicroCeph, and minIO.
  • Knowledge of hypervisor technologies, specifically KVM, and experience with operating systems such as Ubuntu, Red Hat Enterprise Linux, and CentOS.
  • Experience in building and maintaining code distribution through automated pipelines.
  • Proficient in developing cloud-native Kubernetes workloads and pipelines for delivery.
  • Familiarity with configuration management tools such as Ansible or Puppet.
  • Understanding of software-defined networking technologies, including OVS (Open vSwitch) and NFV.
  • Experience with Infrastructure as Code tools such as Terraform, Ansible, Git, GitLab, Helm, and ArgoCD.

This position offers an exciting opportunity to work within a dynamic team environment, driving improvements and providing exceptional service to customers in the cloud solutions space.

Apply Direct

Jobs you might like   View all jobs

About IT Services and IT Consulting Company

Company details are hidden. Subscribe to view full company profile.

Ready to apply for this role?

Apply Direct