Post a Job

Senior Site Reliability Engineer

Unlock employer Riyadh, Saudi Arabia Posted: 20 Apr 2026

Apply Direct

Financial

Estimate: $80k - $100k*
Zero income tax location

Accessibility

Apply from abroad
Visa Provided

Requirements

Experience: Senior
English: Fluent

Explore similar roles:

View Site Reliability Engineer jobs in Riyadh · View all Site Reliability Engineer jobs

Position

The Cloud team at the company is currently seeking a Senior Site Reliability Engineer. In this position, the individual will be responsible for providing reliability to the cloud infrastructure that enabled the company's cloud-based services and internal systems.

Ready to apply for roles like this?

Unlock the company name and direct application link. Subscribers get instant access to fresh jobs across Dubai, Abu Dhabi and Riyadh, many with visa support.

Unlock employer & apply directly

Responsibilities

Provide Reliability Engineering to cloud services deployed and managed in the region of KSA.
Continuous delivery (CI/CD) using ArgoCD, Jenkins, Maven, and Docker.
Architect cloud systems in highly available design, ensuring Disaster Recovery (DR) measures are in place.
Containerization and deployment of microservices and data pipeline on Kubernetes using Helm installation.
Advocate for a DevOps culture of automation and engineering best practices to enable development teams.
Auto-scale and monitor performance for Kubernetes and running applications using Prometheus and Grafana or similar tools.
Performing SRE activities such as availability and reliability monitoring and reports.
Deploy, configure and maintain tools such as Kafka, Spark, Trino, Airflow, MQTT, and Microservices.
Set up infrastructure as a service using Terraform.
Work and deploy using the codebase repositories in GitLab, along with participation in the peer review activities.
Support No-SQL databases such as Elastic Search, Mongo, Cassandra, and other open-source services.
Set up and monitor various applications and services. Continuously enhance the alerts and automate the recovery process.
Participate in the on-call rotation to keep up the service SLA per the business needs.
Work with Product Owners, engineering managers, and other team members in Agile Scrum and Kanban mode.
Take appropriate actions by doing impact analysis during the incidents.

Requirements and Skills

B.S. or M.S. degree in Computer Science, Engineering, or equivalent field.
Can speak English fluently to communicate with teams across geographical regions.
+6 years of experience in SRE or DevOps Engineering.
+5 years of experience deploying and maintaining systems on Oracle cloud (OCI) and Amazon web services (AWS).
+3 years of experience in Terraform.
+3 years of experience in Kubernetes.
+3 years of experience with tools such as Jenkins, ArgoCD, etc. to build automation and CI/CD pipelines.
Professional experience in programming or scripting languages using Python, Go, Bash/Shell, or others.
Experience in administrative operations knowledge in RDBMS (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB).
Moderate experience in distributed computing for running data systems using Spark, Hive, Zookeeper, and Kafka.
Moderate experience in debugging tools and troubleshooting performance bottlenecks at the infrastructure or application tier.
Good to have experience with Config Management using Ansible, Chef, Puppet, or others.