ML Engineer | Building Scalable AI Systems
ML + Backend Engineer pursuing an MS in Computer Science at UMass Amherst. Currently a Graduate Student Researcher at Allen Institute. Former Software Engineer at Microsoft on the Bing Ads team, building ML models for ad ranking and optimization serving millions of users globally.
Expertise in multi-agent LLM systems, production ML pipelines, privacy-preserving AI, and model evaluation. Research focus on LLM alignment, bias detection, and context engineering. Passionate about building reliable, scalable AI systems applying research to production, using LLMs, along with AI Security.
End-to-end identity verification system using Fireworks AI for document understanding. Automated extraction from passports and driver's licenses with >95% field accuracy. Includes classification, validation, and complete audit trail for regulatory compliance.
Vision LLM-powered system navigating web apps (Notion, Asana) via screenshot analysis and DOM parsing. Modular framework with multi-LLM provider support, hybrid state detection, and automated dataset generation.
Shopping recommendation system using Differential Privacy and Federated Learning. Validated against Membership Inference Attacks, misalignment, jailbreaking, and prompt injection attacks on LLMs.
Multimodal (voice + text) AI companion running entirely on-device. Implemented federated learning for personalized adaptation to tone and mood without data leaving the device.
GPT-4o + Google Vision API pipeline for document classification, OCR, and unstructured-to-structured conversion. 99% time reduction in document processing.
Document retrieval pipeline with segmentation, pseudo-query generation, and LLM fine-tuning. 4% precision improvement over BM25 baseline.
LangChain-powered Q&A bot that extracts answers from PDF and JSON documents. Automated question-answer pair generation with context-aware responses using OpenAI API.
Statistical analysis of pronoun and occupational biases in LLMs. Published research with mitigation strategies. Submitted to COLM'25
Apache Kafka streaming pipeline processing thousands of events/second. Neural network models predict repository trends for proactive discovery.
Intelligent bot for conversation summarization and Q&A from Slack channel history, improving team productivity.
Multi-agent embodied AI simulation platform built with Unity and Python. Generated benchmarks for speed and resolution across robots, agents, and scenes. Accepted at ICLR 2026
Comprehensive analysis of pronoun and occupational biases in LLMs using statistical tests. GPT-4o: <5% non-preferred pronoun selection; Qwen2.5: 100% positional bias. Submitted to COLM'25
Context Engineering • Post-Training & Alignment • LLM Inference • Multi-Modal LLMs • Multi-Agent Systems • Bias Detection & Mitigation • Privacy-Preserving ML
Working on semantic-driven language model pre-training, exploring novel approaches to improve language understanding through semantic representations and structured knowledge integration.
Built multi-agent AI pipeline for NL-to-SQL generation with RAG-based feedback loop, achieving 98% success rate. Designed comprehensive evaluation framework for assessing query accuracy and system reliability.
Built and sold enterprise healthcare document processing platform to medical insurance company. Architected end-to-end AI pipeline using GPT-4o and Vision API for document classification and OCR, reducing processing time by 99% and enabling thousands of daily document processing with high accuracy.
Developed ML models for crop identification and health monitoring with 96% accuracy. Built forecasting algorithms using ARIMA and Prophet for environmental insights and crop recommendations.
Analyzed seasonality patterns in advertiser campaigns using time-series modeling. Optimized bid pricing algorithms for peak holiday periods.
Microsoft (FY20-21) — Pipeline migration project improving operational efficiency
MIT Hacks 2025 (Judge) • HackHarvard 2024 (Mentor) • She Hacks DTU 2021 (Mentor)
Grace Hopper Celebration 2025 — Application review across multiple verticals
Proposed interpretable solution for SpO2 measurement error analysis
Inspire Scholarship from Government of India (2015)