Presight Circular Logo

Senior Data Engineer (12 months contract)

Presight Abu Dhabi, United Arab Emirates Posted: 05 Dec 2024

Financial

  • Estimate: $80k - $120k*
  • Zero income tax location

Accessibility

  • Apply from abroad
  • Visa Provided

Requirements

  • Experience: Senior
  • English: Professional

Position

About
Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). It combines big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With its world-class computer vision, AI, and omni-analytics platform as its engine, Presight leverages all-source data to support insight-driven decision-making that shapes policy and creates safer, healthier, happier, and more sustainable societies.

The Opportunity
Presight is looking for an astute, proficient, and qualified Senior Data Engineer to assess, analyze and work with data concepts, use-cases & complex new data sources to provide business insights to customers and support the implementation & integration of the data sources into the Presight AI platform.

Responsibilities
Key Responsibilities:

  • Solve challenging problems using python coding skills.
  • Design, build and launch new data extraction, transformation & loading processes in production.
  • Web crawling, data cleaning, data annotation, data ingestion, and data processing.
  • Reading and collating complex data sets.
  • Creating and maintaining data pipelines.
  • Continual focus on process improvement to drive efficiency and productivity within the team.
  • Use of Python, SQL, ES, Shell etc. to build the infrastructure required for optimal extraction, transformation, and loading of data.
  • Provide insights into key business performance metrics by building analytical tools that utilize the data pipeline.
  • Support the wider business with their data needs on an ad hoc basis.
  • Comply with QHSE (Quality Health Safety and Environment), Business Continuity, Information Security, Privacy, Risk, Compliance Management, and Governance of Organizations policies, procedures, plans, and related risk assessments.

Requirements

  • Bachelor's degree in computer engineering, Computer Science, or Electrical Engineering and Computer Sciences.
  • 6+ years of programming experience, solid coding skills in Python, Shell, and Java.
  • Experience with web crawling and cleaning.
  • In-depth knowledge in the design and implementation of Spark jobs to execute, schedule, monitor, and control processes.
  • Experienced in Spark SQL and Postgres query languages.
  • Skilled in writing complex queries with joins for processing large datasets.
  • Proficient in using containerization technologies such as Docker.
  • Experienced with orchestration tools like Kubernetes.
  • Adept at implementing testing and monitoring systems for data pipelines to ensure high availability and reliability.
  • Experience with tools like Apache Kafka and Apache Flink for real-time data processing.
  • Skilled in using data orchestration tools like Apache Airflow and Apache NiFi.
  • Strong understanding of Elasticsearch architecture, queries, and ingestion techniques.
  • Experience with solution architecture, data ingestion, query optimization, data segregation, ETL, ELT, AWS, EC2, S3, SQS, lambda, Elastic Search, Redshift, CI/CD frameworks, and workflows.
  • Working knowledge of data platform concepts - data lake, data warehouse, ETL, big data processing (designing and supporting variety/velocity/volume), real-time processing architecture for data platforms, scheduling, and monitoring of ETL/ELT jobs.
  • Proficient in PostgreSQL and programming (preferably Java, Python), with proficiency in understanding data, entity relationships, structured & unstructured data, SQL, and NoSQL databases.
  • Knowledge of best practices in optimizing columnar and distributed data processing systems and infrastructure.
  • Experienced in designing and implementing dimensional modeling.
  • Knowledge of machine learning and data mining techniques in one or more areas of statistical modeling, text mining, and information retrieval.
  • Strong analytical skills & problem-solving skills.

What we look for
If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Presight community.

What working at Presight offers

  • Culture: An open, diverse, and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.
  • Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.
  • Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits, and more.
Apply now

Jobs you might like   View all jobs

About Presight

Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). We combine big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With our world-class computer vision, AI and omni-analytics platform as its engine, we excel at all-source data interpretation to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.