About
Presight is an ADX-listed public company with Abu Dhabi-based G42 as its majority shareholder and is the region’s leading big data analytics company powered by GenAI. It combines big data, analytics, and AI expertise to serve every sector of every scale, creating business and positive societal impact. Presight excels at all-source data interpretation to support insight-driven decision-making that shapes policy and fosters safer, healthier, happier, and more sustainable societies. Through its range of GenAI-driven products and solutions, Presight is enabling both private and public sectors to realize their AI strategies and ambitions faster.
The Opportunity
We are looking for an astute, proficient, and qualified Data Engineer to assess, analyze, and work with data concepts, use cases, and complex new data sources to provide business insights to customers and support the implementation and integration of these data sources into the Presight platform.
Responsibilities
- Develop high-performance data solutions and information products at Presight.
- Have a solid background in software development with strong Python coding skills to solve challenging problems.
- Engage in web crawling, data cleaning, data annotation, data ingestion, and data processing.
- Read and collate complex data sets.
- Create and maintain data pipelines.
- Focus continually on process improvement to drive efficiency and productivity within the team.
- Use Python, SQL, Shell, etc., to build the infrastructure required for optimal extraction, transformation, and loading of data.
- Provide insights into key business performance metrics by building analytical tools that utilize the data pipeline.
- Support the wider business with their data needs on an ad hoc basis.
- Comply with QHSE (Quality Health Safety and Environment), Business Continuity, Information Security, Privacy, Risk, Compliance Management, and Governance of Organizations policies, procedures, plans, and related risk assessments.
Requirements
- Bachelor's degree in computer engineering, Computer Science, or Electrical Engineering and Computer Sciences.
- 3+ years of programming experience with solid coding skills in SQL, Python, Shell, Java, Scala (optional).
- Understanding of data warehouse architecture, including ETL/ELT workflows, data pipeline design for structured and semi-structured data, with experience in modern solutions like Snowflake, BigQuery, and Amazon Redshift.
- Proficient in batch data processing with tools like Apache Airflow or Luigi, and knowledgeable about real-time processing workflows and low-latency data system challenges.
- Expertise in writing efficient SQL queries, optimizing database performance, and using relational databases like Oracle, PostgreSQL, ClickHouse, and DuckDB, with knowledge of advanced SQL features such as window functions, CTEs, and partitioning.
- Familiarity with NoSQL databases like ElasticSearch, Redis, and document-oriented databases (e.g., MongoDB).
- Proficient in Python for data engineering with libraries like Pandas, PySpark, and Dask; experienced in big data and MPP frameworks such as Apache Spark, with skills in optimizing and debugging distributed data processing jobs.
- Understanding of data modeling principles, including dimensional modeling, SCD handling, surrogate key generation, and designing schemas for analytical workloads with complex hierarchies.
- Familiarity with BI tools like Power BI, Tableau, Apache Superset, or Looker. Experience in integrating BI tools with data warehouses or other data sources.
- Knowledge of best practices for data security, including encryption, access control, and IAM policies.
Ideal Candidate Profile
- Strong analytical and data visualization skills.
- In-depth market and domain knowledge.
- An innovative and creative approach to problem-solving.
- Excellent communication and presentation skills.
Work Conditions
- Culture: An open, diverse, and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.
- Career: Outstanding learning, development, and growth opportunities via structured training programs and innovative, high-tech projects.
- Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits, and more.