Featured Projects

A curated list of my AI, ML, and data engineering work.

Data Engineering

Car Sales Data Engineering Pipeline

End-to-end ETL pipeline for automotive sales data using Azure Data Factory and Databricks. Performed transformations with PySpark and stored curated Delta tables in Delta Lake for analytics.

Stack: Azure Data Factory, Azure Databricks, PySpark, Delta Lake, SQL

View on GitHub
Machine Learning

Amazon Prime Video Data Analysis & ML Insights

Performed EDA and visualization on Amazon Prime Video dataset. Built ML models to analyze content distribution, trends, and platform insights. Includes charts, graphs, and ML model outputs.

Stack: Python, Jupyter Notebook, Pandas, Matplotlib, Seaborn, ML

View on LinkedIn
Applied AI

RAG-Driven PDF Intelligence Assistant

Built a Retrieval-Augmented Generation (RAG) system that ingests PDFs, generates embeddings, and retrieves context-aware answers using FAISS, LangChain, and Groq LLMs.

Stack: Python, LangChain, FAISS, Groq LLM, Sentence Transformers

View on GitHub
Machine Learning

End-to-End ML Project

Built an end-to-end machine learning pipeline including preprocessing, model training, evaluation, experiment tracking, and deployment-ready structure.

Stack: Python, Scikit-learn, Pandas, ML Pipeline, Azure ML

View on GitHub