Hitesh Kumar Sahoo

Senior Data Engineer
Pune, IN.

About

Highly accomplished Senior Data Engineer with 10+ years of expertise in data engineering, ETL, and BI solutions, specializing in building scalable analytics-ready platforms. Microsoft and Snowflake certified, I leverage Dimensional Modeling and Medallion architecture to enhance data quality and accelerate decision-making, particularly within the Oil & Gas and Healthcare sectors. My leadership in client-facing projects drives significant cost savings, operational efficiency, and cross-functional data collaboration.

Work

NTT DATA
|

Senior Data Engineer

Pune, Maharashtra, India

Summary

Led data engineering initiatives for ExxonMobil's Supply Chain Analytics, driving cloud adoption and optimizing data workflows across global teams.

Highlights

Spearheaded a global team of 7-8 junior data engineers across India, US, and Brazil, leading ExxonMobil's Supply Chain Analytics initiatives and fostering team development.

Directed UAT phases onsite in Brazil and Vietnam, successfully migrating legacy on-prem databases to Snowflake, achieving seamless cloud adoption and enhancing client confidence.

Engineered robust data pipelines using Azure Data Factory to ingest diverse structured and semi-structured data (CSV, JSON, Parquet) into Fabric Lakehouse, leveraging PySpark and Python/Pandas for transformations to create curated Delta tables in the Silver layer of a Medallion architecture.

Architected Gold layer models within the Data Warehouse using dimensional modeling (Star and Snowflake schemas) on curated Delta tables, implementing partitioning, clustering, and performance tuning to optimize BI tool connections.

Optimized long-standing Snowflake data flow code, reducing runtime and compute usage by 49.4% and achieving monthly cost savings of ~$3,631 USD, earning executive recognition.

Enabled secure data sharing across partner teams, eliminating data pipeline and duplication efforts, thereby enhancing cross-business data collaboration and efficiency.

Developed and automated a complex email notification system for Snowflake task success/failure using Snowpark, Azure Data Factory, Azure Data Lake Storage, and Logic Apps, improving operational visibility.

Enhanced data delivery capabilities by implementing advanced Snowflake features including Snowpipe, Stream, Time Travel, Cloning, Sharing, Tasks, Materialized Views, External Tables, Procedures, and CDC.

NTT DATA (formerly Hashmap)
|

Senior Data Analyst | BI Lead

Pune, Maharashtra, India

Summary

Led a global BI team for Petronas Malaysia, delivering enterprise reporting solutions and optimizing data analytics processes.

Highlights

Directed a global Business Intelligence team of 9-10 members across US, Malaysia, and India, successfully delivering comprehensive enterprise reporting solutions for Petronas Malaysia.

Developed and optimized data marts and semantic models within the BI Analytics layer, significantly improving performance and usability of reporting solutions.

Automated the conversion of all manual Business Excel reports to Spotfire using Alteryx ETL pipelines while onsite in Malaysia, reducing usage time and effort by 70%.

Improved dashboard performance by integrating Common Table Expressions (CTEs), Python, and R for complex data manipulations.

Reduced report execution time by implementing Row-Level Security (RLS), caching mechanisms, and parameterized inputs.

Successfully migrated over 90 developed reports from Azure to Google Cloud Platform, ensuring seamless project completion and handover.

IQVIA (Formerly Quintiles)
|

Software Engineer 1

Bangalore, Karnataka, India

Summary

Developed and maintained ETL workflows and BI solutions for healthcare analytics, contributing to data-driven insights.

Highlights

Assisted senior engineers in adopting new technologies, progressively taking ownership of ETL workflows and Business Intelligence solutions.

Developed efficient ETL and database solutions leveraging SQL and PL/SQL Stored Procedures for data processing.

Designed, delivered, and maintained interactive BI dashboards using Spotfire and Power BI, integrating multiple data sources via IronPython, R, and JavaScript APIs.

Education

Trident Academy of Technology
Bhubaneswar, Odisha, India

B.Tech

Computer Science & Engineering

Grade: 7.5 CGPA

Languages

English

Certificates

Microsoft Certified: Fabric Data Engineer Associate

Issued By

Microsoft

Snowflake SnowPro Core Certification

Issued By

Snowflake

Tibco Certified Professional and Associate (Spotfire)

Issued By

Tibco

Alteryx Designer Core Certified

Issued By

Alteryx

Data Science for Engineer in R by NPTEL

Issued By

IIT Madras / NPTEL

Skills

Cloud & Data Engineering

Snowflake, Microsoft Fabric, Azure Cloud, Dimensional Modeling, GitHub, Data Warehousing, ETL, Medallion Architecture, Data Security, CI/CD, Azure DevOps, Cloud Migration, Data Governance, Data Lakehouse, Delta Tables.

Programming & Scripting

SQL, Python, Pandas, PySpark, JavaScript, PL/SQL, R Programming, Snowpark.

Data Analytics & BI Tools

Alteryx, Qlik Replicate, dbt, Spotfire, Tableau, PowerBI.

Leadership & Project Management

Team Leadership, Project Management, Client Management, Stakeholder Engagement, UAT Coordination, Cross-functional Collaboration.