Data Engineer with 8+ years in Marketing, Finance, Sales & Product Domain, skilled in crafting distrusted processing and streaming solutions that fuel analytics and enable precise decision-making. Proficient in Python, SQL RDMS, Spark, and AWS technologies—including Redshift, Snowflake, and Kinesis—I build efficient data pipelines and optimized warehouses to manage large-scale and real-time data flows. I lead teams to deliver end-to-end data systems, from designing robust architectures and DBT driven pipelines to shaping analytics-ready layers and dashboards with Tableau and QuickSight.
Skills
- Programming: Python, SQL, Spark,
- Streaming: Kinesis, Kafka, SQS,
- ETL: Git, Apache Airflow, Fivetran, Matillion, DBT,
- Data Visualization: Tableau, QuickSight
- Data Warehouse: Data Lake, Redshift, Snowflake, RDMS, MongoDB,
- Cloud: AWS, Lambda, Glue, S3, Athena, EMR, EC2,
- Data Modeling: Dimensional Modeling, Star Schema, Normalization Techniques
Experience
AUTODESK
Bengaluru, India
Title - Senior Data Engineer: Domain - Marketing | Sales | Finance
08/2023 - Present
-
Led team in developing a multi touch attribution (MTA) and full-funnel metrics platform from scratch, integrating it into the AWS marketing stack. Collaborated with stakeholders to define and refine requirements, using PySpark, Snowflake, ~90 DBT models to automate 5 TB of data.
-
Utilized Fivetran APIs to ingest digital campaign and Marketo app engagement data, processed it, and built analytics tables in Snowflake with DBT. Orchestrated via Airflow, reducing ingestion latency by 30% and AWS costs by 25%. Collaborated with a global team on scalable pipelines and mentored a junior engineer on technical growth and optimization.
-
Built near real-time data streaming solution with AWS Kinesis pulling data from the Marketo app, and managed orchestration using Apache Airflow & DBT Processed
AMAZON
Title - Business Intelligence Engineer: Domain - AWS Product | Sales | Finance
05/2022 - 08/2023
- Collaborated with AWS finance for WWSO’s MBR reporting, creating datasets and reports for headcount, sales, pipeline, forecasting, deals, and discounts using Redshift, Python, Tableau integrating S3 to Redshift with Lambda in process automating ~900 Hrs of manual work.
- Designed and implemented data warehousing solutions for EC2 compute product usage and billing using Amazon Redshift. Developed schema design, data ingestion, modeling, and transformation pipelines with Apache Airflow and internal ETL tools, integrating Kinesis for streaming data into S3 to reporting product analytics.
- Participate in our GIT code repository by collaborating.
Chubb
Bengaluru, India
Title - Data Engineer: Domain - Insurance | Finance
06/2021 - 05/2022
Contributed to setting up ETL automation, standard script logging and monitoring the status of jobs for historical analysis. Worked on performance optimization of existing & new API data extractions script to speed up execution time by 90%