Work Experience
Download ResumeData Engineering Lead (CMU Capstone)
eParts Services LLC | Pittsburgh, PA
January 2025 – Present
- Led a 6-member team to design and implement a cloud-native multi-tenant data lakehouse on Snowflake with Medallion architecture, designed for cloud-vendor portability and extensible data integration, replacing legacy SQL-based reporting and reducing production load by 30%.
- Engineered a Java microservice to expose ML-ready versioned datasets via multi-tenant RESTful APIs with Auth0/OIDC integration for security, enabling scalable feature engineering and model training
- Architected a fault-tolerant CDC pipeline with Kafka and Debezium, enabling real-time ingestion at scale.
- Automated cloud infrastructure deployment on Kubernetes (AKS) using Terraform and Helm, establishing a reproducible environment enabling rapid onboarding of new tenants in less than 3 hours (~75% reduction).
Key Technologies: Snowflake, Kafka, Debezium, Azure AKS, Terraform, Helm, Kubernetes, Java (Spring Boot)
Software Engineer
Silicon Labs | Hyderabad, India
July 2021 – May 2024
- Architected the backend infrastructure for a high-throughput distributed IoT logging system (IOT Core, Kafka, Java microservices, TimescaleDB), ingesting 100k+ logs/min, enabling real-time telemetry visibility for QA and internal BI.
- Owned design and implementation of a Java microservice (Spring Boot) for idempotent Kafka-based log ingestion with batched writes to a managed PostgreSQL/TimescaleDB backend and DLQ routing, achieving 99.9 % availability SLO.
- Developed a GraphQL API powering an internal logging dashboard, enabling QA and PMs to access and analyze device log metrics independently and reducing triage time by 50%.
- Optimized TimescaleDB schema with time-based partitioning and continuous aggregates, reducing query latency p95 by 40% for QA analytics and common query patterns (time-range, firmware version, severity filters).
- Integrated AWS CloudWatch data with Grafana to monitor health metrics, reducing high-severity incidents by 30%.
- Collaborated cross-functionally withs stakeholders (5 teams) to standardize telemetry schemas reducing MTTR by 20%.
- Mentored 5+ junior engineers and interns through code/design reviews, created onboarding materials, and contributed to hiring by conducting technical interviews.
- Won Silicon Labs internal Technical Symposium for innovations in logging optimization and test automation.
Key Technologies: C (SDK), Java (Spring Boot), Kafka (MSK), TimescaleDB (PostgreSQL extension), GraphQL, Redis, React, AWS IoT Core, Kubernetes (EKS), EC2, Grafana
Data Solutions Intern
BNY Mellon | Pune, India (Remote)
August 2020 – December 2020
- Automated Python/SQL-based ETL workflows for 15 client-facing market reports, reducing manual effort by approximately 10 hours per week per report.
- Developed robust data validation and anomaly detection mechanisms within the pipelines to significantly improve accuracy and reduce errors in final reports.
Key Technologies: Python (Pandas, NumPy), SQL, Pentaho, Toad, MS Excel
Intern
Indian Meteorological Department | Pune, India
May 2019 – July 2019
- Developed an IoT-based automation system using an Arduino microcontroller for remote weather monitoring.
- Designed the system to upload data every few seconds, dramatically increasing data collection frequency compared to the previous hours-long manual process.
Key Technologies: Arduino (C++), GPRS Modules, Weather Sensors