My Projects
From architecting high-throughput cloud platforms to implementing complex deep learning models, I specialize in building robust, data-intensive applications. Below is a selection of my work.
software
Cloud-Native ETL & Analytics Platform
Carnegie Mellon University
Feb 2025 - Apr 2025
AWS (EKS)Go (GoFiber)SparkScalaKubernetesRedisElastiCacheTerraform
- Engineered Go (GoFiber) microservices on Kubernetes that handled ~125K RPS with a P99 latency of 5ms.
- Built a cost-optimized ETL pipeline using Spark (Scala) to process 1TB of raw data from Azure Blob.
- Secured 3rd place out of 40 teams by designing a high-throughput, low-latency data platform on AWS.
software
Real-Time Stream Processing System
Carnegie Mellon University
April 2025
KafkaSamzaYARNJava
- Designed a stream-processing system using Kafka and Samza to join multiple real-time GPS data streams.
- Deployed the driver-matching application on a YARN cluster, ensuring high availability and low-latency processing.
ai ml
Generative Diffusion Models for Image Synthesis
Carnegie Mellon University
Sept 2024 - Dec 2024
PyTorchPythonU-NetVAEDDPM/DDIM
- Implemented Denoising Diffusion Probabilistic Models (DDPM) and a U-Net architecture from scratch in PyTorch for iterative image generation.
- Integrated Variational Autoencoders (VAE) and Classifier-Free Guidance (CFG) to enhance conditional generation, evaluating performance with FID scores.
software
Jira Sprint Analytics Dashboard
Personal Project
May - Jun 2025
JavaScriptHTML/CSSChart.jsJira REST API
- Developed a client-side web dashboard using vanilla JavaScript and Chart.js to visualize real-time sprint data from the Jira REST API.
- Provides teams with actionable insights into velocity, scope changes, and task completion trends to improve agile workflows.
software
Heterogeneous Cloud Storage System
Carnegie Mellon University
March 2025
MySQLMongoDBRedisTerraformAzureJava
- Designed a polyglot persistence solution for a social network application, using the best database for each specific data type.
- Utilized MySQL for structured data, MongoDB for flexible user profiles, and an in-memory Redis caching layer to accelerate queries.
ai ml
Attention-Based Speech Recognition Transformer
Carnegie Mellon University
Nov 2024
PyTorchPythonTransformersAttention Mechanisms
- Built an end-to-end, attention-based speech-to-text transformer from scratch.
- Implemented the full encoder-decoder architecture with multi-head self-attention and cross-attention mechanisms in PyTorch.
ai ml
Sequence-to-Sequence Phoneme Transcription
Carnegie Mellon University
Oct 2024
PyTorchPythonLSTMs/GRUsCTC Loss
- Developed a deep sequence-to-sequence model using LSTMs to map audio utterances to phoneme sequences.
- Implemented Connectionist Temporal Classification (CTC) loss to handle alignment challenges between variable-length audio and text.
ai ml
Human Facial Expression Classification
BITS Pilani
Feb 2021 - Apr 2021
PythonCNNsscikit-learnOpenCV
- Implemented and evaluated various CNN architectures for facial expression recognition.
- Developed strategies to handle class imbalances and overfitting to achieve competitive evaluation metrics on a public dataset.