Work Experience

Download Resume

Data Engineering Lead (CMU Capstone)

eParts Services LLC | Pittsburgh, PA

January 2025 – Present

  • Led a 6-member team to design and implement a cloud-native multi-tenant data lakehouse on Snowflake with Medallion architecture, designed for cloud-vendor portability and extensible data integration, replacing legacy SQL-based reporting and reducing production load by 30%.
  • Engineered a Java microservice to expose ML-ready versioned datasets via multi-tenant RESTful APIs with Auth0/OIDC integration for security, enabling scalable feature engineering and model training
  • Architected a fault-tolerant CDC pipeline with Kafka and Debezium, enabling real-time ingestion at scale.
  • Automated cloud infrastructure deployment on Kubernetes (AKS) using Terraform and Helm, establishing a reproducible environment enabling rapid onboarding of new tenants in less than 3 hours (~75% reduction).

Key Technologies: Snowflake, Kafka, Debezium, Azure AKS, Terraform, Helm, Kubernetes, Java (Spring Boot)

Software Engineer

Silicon Labs | Hyderabad, India

July 2021 – May 2024

  • Architected the backend infrastructure for a high-throughput distributed IoT logging system (IOT Core, Kafka, Java microservices, TimescaleDB), ingesting 100k+ logs/min, enabling real-time telemetry visibility for QA and internal BI.
  • Owned design and implementation of a Java microservice (Spring Boot) for idempotent Kafka-based log ingestion with batched writes to a managed PostgreSQL/TimescaleDB backend and DLQ routing, achieving 99.9 % availability SLO.
  • Developed a GraphQL API powering an internal logging dashboard, enabling QA and PMs to access and analyze device log metrics independently and reducing triage time by 50%.
  • Optimized TimescaleDB schema with time-based partitioning and continuous aggregates, reducing query latency p95 by 40% for QA analytics and common query patterns (time-range, firmware version, severity filters).
  • Integrated AWS CloudWatch data with Grafana to monitor health metrics, reducing high-severity incidents by 30%.
  • Collaborated cross-functionally withs stakeholders (5 teams) to standardize telemetry schemas reducing MTTR by 20%.
  • Mentored 5+ junior engineers and interns through code/design reviews, created onboarding materials, and contributed to hiring by conducting technical interviews.
  • Won Silicon Labs internal Technical Symposium for innovations in logging optimization and test automation.

Key Technologies: C (SDK), Java (Spring Boot), Kafka (MSK), TimescaleDB (PostgreSQL extension), GraphQL, Redis, React, AWS IoT Core, Kubernetes (EKS), EC2, Grafana

Data Solutions Intern

BNY Mellon | Pune, India (Remote)

August 2020 – December 2020

  • Automated Python/SQL-based ETL workflows for 15 client-facing market reports, reducing manual effort by approximately 10 hours per week per report.
  • Developed robust data validation and anomaly detection mechanisms within the pipelines to significantly improve accuracy and reduce errors in final reports.

Key Technologies: Python (Pandas, NumPy), SQL, Pentaho, Toad, MS Excel

Intern

Indian Meteorological Department | Pune, India

May 2019 – July 2019

  • Developed an IoT-based automation system using an Arduino microcontroller for remote weather monitoring.
  • Designed the system to upload data every few seconds, dramatically increasing data collection frequency compared to the previous hours-long manual process.

Key Technologies: Arduino (C++), GPRS Modules, Weather Sensors