Rackspace Technology Circular Logo

Site Reliability / DevOps Engineer

Rackspace Technology Riyadh, Saudi Arabia Posted: 01 Aug 2024

Financial

  • Estimate: $70k - $120k*
  • Zero income tax location

Accessibility

  • Office Only
  • Visa Provided

Requirements

  • Experience: Unspecified
  • English: Professional

Position

About the Job
Site Reliability and DevOps Engineering is a key requirement within the Rackspace support infrastructure, providing a prominent level of technical support to customers via phone, ticketing systems, and automation. This role is responsible for managing complex customer issues that may take several days or weeks to resolve, maintaining ongoing communication to keep customers updated. The position provides a framework for system development, maintenance, and enhancements, while implementing standards and guidelines of Site Reliability Engineering (SRE) support.

As a member of the SRE team, you will operate our private cloud and develop tools and integrations for a portfolio of cloud infrastructure services. Your expertise in private cloud will drive improvements in operations and releases through code, utilizing common open-source observability tools like ELK and Grafana for proactive alerts to measure and maintain Service Level Objectives. Collaboration with the Tier 1 team on escalations will offer opportunities for automation and process improvements.

This is an onsite position based in the Riyadh region of KSA.

Responsibilities

  • Develop and deliver software to build and enhance the functionality, reliability, availability, and manageability of applications and cloud platforms using a DevOps model for On-Prem environments (OpenStack and Kubernetes).
  • Automate development, testing, and deployment processes through CI/CD pipelines (Git, GitLab, Helm, ArgoCD) to support various architectures.
  • Work with Tier 1 support to handle system and customer escalations.
  • Troubleshoot and resolve infrastructure-related issues, including those associated with Kubernetes clusters, applications, workloads, and networks.
  • Collaborate with software engineering teams to optimize application performance and reliability.
  • Ensure the reliability and continuity of Kubernetes or Virtual Machine (VM) workloads.
  • Continuously evaluate and improve systems and processes to enhance reliability, performance, and efficiency.
  • Stay updated with industry trends, best practices, and emerging technologies in SRE and DevOps.

Role Requirements

  • Solid background in private cloud infrastructure (OpenStack, VMware, Kubernetes) with operational and troubleshooting experience.
  • Familiarity with the software development lifecycle including development, testing, packaging, deployment, upgrade, and support.
  • Direct experience in private cloud and Kubernetes resource development and operations, including knowledge of major OpenStack components (e.g., Keystone, Nova, Neutron, Glance) and Kubernetes components (e.g., CNI, CRI, CSI, control plane).
  • Proficiency in software development using Python, including the ability to write patches for OpenStack and workload manifests in YAML for Kubernetes.
  • Experience supporting software-defined storage solutions such as Ceph or other cloud-based storage systems, as well as Kubernetes Container Storage interfaces like Portworx, MicroCeph, and minIO.
  • Knowledge of hypervisor technologies, specifically KVM, and experience with operating systems such as Ubuntu, Red Hat Enterprise Linux, and CentOS.
  • Experience in building and maintaining code distribution through automated pipelines.
  • Proficient in developing cloud-native Kubernetes workloads and pipelines for delivery.
  • Familiarity with configuration management tools such as Ansible or Puppet.
  • Understanding of software-defined networking technologies, including OVS (Open vSwitch) and NFV.
  • Experience with Infrastructure as Code tools such as Terraform, Ansible, Git, GitLab, Helm, and ArgoCD.

This position offers an exciting opportunity to work within a dynamic team environment, driving improvements and providing exceptional service to customers in the cloud solutions space.

Apply now

Jobs you might like   View all jobs

About Rackspace Technology

Be ready for what’s next with multicloud solutions from Rackspace Technology™. We are the multicloud solutions experts. We know what you're up against because we've helped global companies across industries through it — and you can bet we're ready to help. Our team delivers results by listening to your challenges and building custom cloud services and solutions that help your business perform better now and into the future.