Hello! I'm

SAHIL
VACHHANI

MS in Data Science @ University at Buffalo.
2× intern across NLP and ML. I build things that turn messy data into decisions — pipelines, models, and dashboards that actually get used.

Sahil Vachhani

MS Graduate

DATA
SCIENTIST

Python · SQL · NLP · LLMs · AWS · Power BI.
Open to full-time roles in Data Science, AI/ML Engineering, Data Engineering and Analytics.

Resume ↗
About

TURNING DATA
INTO DECISIONS
THAT MATTER

3.5M+
Data Points Processed
50K+
Records Analyzed
95%
ML Model Accuracy
2
Internships

Data doesn't tell stories on its own — I build the systems that make it speak. From NLP pipelines that cut manual review effort by 40% to ML models analyzing 3.5M+ taxi trips in near real-time, I focus on work that actually ships and scales. I recently graduated with an MS in Data Science from the University at Buffalo (GPA: 3.56), with hands-on experience across Python, SQL, AWS, LLMs, and BI tools. Right now I'm looking for full-time opportunities where I can bring both technical depth and product thinking to the table.

PythonSQLR scikit-learnLightGBM NLPLLMsAWS DockerPower BITableau MLflowFastAPISpark KafkaSnowflake Feature EngineeringTime Series A/B TestingETL / ELT Data Modeling
Work

SELECTED PROJECTS

View All ↗
NLP · LLMs 2025
ObitBuddy

End-to-end obituary publishing platform using LLMs and NLP to automate content validation. Reduced manual review effort by 30–40% across 100+ records. Deployed in a live pilot to 50+ stakeholders.

Open project
Data Engineering · BI 2025
Supply Chain Optimization Dashboard

Structured 50K+ logistics, inventory & marketing records with SQL & Python. Designed 10+ KPIs (on-time delivery, inventory turnover, LTV) in an interactive Power BI dashboard with what-if simulations.

Open project
AWS · ML · Streaming 2024
Rideflow Analytics

ML pipeline analyzing 3.5M+ NYC taxi trips to forecast hourly demand. Automated ETL on AWS cut processing time 40%. Normalized schema + tuned SQL indexes improved query latency 25%.

Open project
NLP · MLflow · LLMs 2025
Sentiment Wars

LLM vs ML sentiment analysis pipeline on 50K+ text samples. Fine-tuned DeBERTa-v3-base achieving 93.5% accuracy, outperforming DistilBERT and XGBoost by 8%. MLflow tracking for reproducibility.

Open project
ML · Snowflake 2024
Ride Prediction Analysis

End-to-end ride-demand prediction pipeline on Snowflake with ETL and LightGBM forecasting. Interactive dashboards comparing actual vs predicted rides across key NYC zones. Optimized SQL for 25% faster queries.

Open project
AI · JavaScript 2024
Snap2Vibe AI Music Recommender

AI-powered music recommendation app that analyzes images to detect mood and suggests matching tracks. Combines computer vision with music discovery for a unique user experience.

Open project
Experience

WHERE I'VE WORKED

Aug 2025 – Dec 2025 · Present
APPLIED NLP INTERN
Media Sales Plus Inc. · Buffalo, NY

Applied NLP techniques for text validation, content structuring, and workflow optimization in product features.

Implemented LLM-driven pipelines including prompt design and evaluation to enhance model reliability.

Conducted Python-based data analysis and supported backend/API integrations for end-to-end functionality.

Jan 2024 – Apr 2024
ML & AI INTERN
Dinjan Infotech Pvt. Ltd. · Surat, India

Built ML solutions for credit card approvals, cancer detection & Uber analytics achieving 90–95% accuracy.

Deployed Telegram + LLM chatbots (OpenAI/Gemini) automating insights and cutting manual analysis time 35%.

Built web-scraping + NLP pipeline collecting 10K records to streamline text analysis and reporting.

Education

WHERE I'VE STUDIED

Aug 2024 – Dec 2025
MS IN DATA SCIENCE
University at Buffalo, The State University of New York · Buffalo, NY
GPA: 3.56 / 4.00
Relevant Coursework
Statistical Learning Applied ML at Scale MLOps Data Models & Query Languages Data Visualization Probability & Statistics Programming & Database Fundamentals
Aug 2020 – Jun 2024
BE IN COMPUTER ENGINEERING
Gujarat Technological University · Ahmedabad, India
CGPA: 7.93 / 10.00
Relevant Coursework
Database Management Systems Artificial Intelligence Big Data Analytics Cloud Computing Statistical Analysis

LET'S BUILD SOMETHING SHARP.

sahilsubhasbhaivachhani@gmail.com · Buffalo, New York