As the leading delivery platform in the region, talabat has a unique responsibility and opportunity to positively impact millions of customers, restaurant partners, and riders. To achieve our mission, we must scale and continuously evolve our machine learning capabilities, including cutting-edge Generative AI (genAI) initiatives. This demands robust, efficient, and scalable ML platforms that empower our teams to rapidly develop, deploy, and operate intelligent systems.
As an ML Platform Engineer, your mission is to design, build, and enhance the infrastructure and tooling that accelerates the development, deployment, and monitoring of traditional ML and genAI models at scale. You’ll collaborate closely with data scientists, ML engineers, genAI specialists, and product teams to deliver seamless ML workflows—from experimentation to production serving—ensuring operational excellence across our ML and genAI systems.
Responsibilities:
- Design, build, and maintain scalable, reusable, and reliable ML platforms and tooling that support the entire ML lifecycle, including data ingestion, model training, evaluation, deployment, and monitoring for both traditional and generative AI models.
- Develop standardized ML workflows and templates using MLflow and other platforms, enabling rapid experimentation and deployment cycles.
- Implement robust CI/CD pipelines, Docker containerization, model registries, and experiment tracking to support reproducibility, scalability, and governance in ML and genAI.
- Collaborate closely with genAI experts to integrate and optimize genAI technologies, including transformers, embeddings, vector databases, and real-time retrieval-augmented generation (RAG) systems.
- Automate and streamline ML and genAI model training, inference, deployment, and versioning workflows, ensuring consistency, reliability, and adherence to industry best practices.
- Ensure reliability, observability, and scalability of production ML and genAI workloads by implementing comprehensive monitoring, alerting, and continuous performance evaluation.
- Integrate infrastructure components such as real-time model serving frameworks, Kubernetes orchestration, and cloud solutions for robust production environments.
- Drive infrastructure optimization for generative AI use-cases, including efficient inference techniques.
- Partner with data engineering, product, infrastructure, and genAI teams to align ML platform initiatives with broader company goals.
- Contribute actively to internal documentation, onboarding, and training programs to promote platform adoption and continuous improvement.
Requirements:
- Bachelor’s degree in Computer Science, Engineering, or a related field; advanced degree is a plus.
- 3+ years of experience in ML platform engineering, ML infrastructure, generative AI, or closely related roles.
- Strong software engineering background with experience in building distributed systems or platforms designed for machine learning and AI workloads.
- Expert-level proficiency in Python and familiarity with ML frameworks and infrastructure tooling.
- Experience implementing modern MLOps practices, including model lifecycle management and infrastructure-as-code tools.
- Proven experience with generative AI technologies.
- Familiarity with SQL and data warehouse modeling; capable of managing complex data queries, joins, aggregations, and transformations.
- Strategic mindset with strong problem-solving skills and effective technical decision-making abilities.
- Excellent communication and collaboration skills, comfortable working cross-functionally across diverse teams and stakeholders.
- Strong sense of ownership, accountability, and proactive bias for action.
Language Requirements:
- Proficiency in English is typically expected (not explicitly mentioned in the original posting but inferred).
Company Overview:
Since launching in Kuwait in 2004, talabat has been a leading on-demand food and Q-commerce app, offering convenience and reliability to its customers across eight countries. We harness innovative technology to simplify everyday life for our customers and optimize operations for our restaurant and local shop partners. At talabat, we are committed to building a high-performance culture through engaged workforce, allowing us to spread positive vibes.