I build large-scale data platforms that power billion-dollar decisions. Currently owning tax data pipelines processing 45B+ records/month across US and Global markets at Amazon Fintech.
A Senior Data Engineer with 9+ years of hands-on experience building highly scalable, distributed big data applications. I design enterprise-grade platforms end to end — from Bronze-Silver-Gold pipeline architectures to real-time ETL monitoring — with deep expertise in PySpark, Airflow, and AWS (EMR, Redshift, S3).
Previously, I built the Data Science Enablement Platform (DEEP) at Optum/UnitedHealth Group, processing 5+ TB daily across a 100 TB cluster with 750+ nodes. At TCS, I created the ExaLogs analytics platform — an org-wide monitoring dashboard that won the Applause Award.
34 Airflow DAGs managing 45B+ records across global markets
71% infrastructure cost reduction through AWS-native migration
54% Redshift query latency reduction via archival framework
Amazon India (Fintech)
Gurugram, Haryana
Project: Fintech TDW • Team Size: 11
To The New Pvt Ltd
Noida, Uttar Pradesh
Optum Global Solution (UnitedHealth Group)
Noida, Uttar Pradesh
Project: Data Science Enablement Platform (DEEP) • Team Size: 24
Tata Consultancy Services Limited
Thane, Maharashtra
Project: TCS Analytics (ExaLogs) • Team Size: 15 • Role: Hadoop Developer
G.L. Bajaj Institute of Technology and Management
Happy to chat about data engineering, pipeline architecture, or opportunities.