Hello, I'm Sanket Muchhala

AI Engineer |

Passionate about building scalable AI solutions using generative AI, LLMs, and NLP. Experienced in designing agentic systems, document intelligence workflows, and ML pipelines deployed at scale.

Sanket Muchhala - AI Engineer and Data Scientist portfolio profile photo

About Me

AI/ML Engineer with 3+ years of experience building scalable solutions using generative AI, LLMs, and NLP. Skilled in designing agentic systems, document intelligence workflows, and ML pipelines deployed at scale.

I specialize in driving automation and data-backed insights across industries like insurance, esports, and enterprise analytics. My expertise spans from research and development to production deployment of AI systems that make a real-world impact.

Education

Master of Science in Data Science

Indiana University Bloomington, USA | Aug 2022 – May 2024

Bachelor of Technology in Information Technology

Thakur College of Engineering and Technology, India | Aug 2018 – May 2022

3+
Years Experience
10+
AI Projects Deployed
5
Industries Impacted

Technical Skills

Programming Languages

Python SQL R JavaScript

AI/ML Frameworks & Tools

Scikit-learn TensorFlow PyTorch FastAPI MLflow SpaCy

Generative AI & LLMs

GPT-4 LangChain RAG Agentic AI Vector DBs (FAISS)

NLP & Document Intelligence

NER Text Classification Summarization Sentiment Analysis

Data Engineering & Storage

Pandas NumPy PySpark AWS (SQS, Step Functions) Azure Data Lake Azure SQL

Visualization & BI

Tableau Power BI R Shiny

Professional Experience

AI Engineer

Progressive Insurance May 2024 – Present
AI-Powered Claims Automation & Risk Analysis (ACARA)
  • Engineered custom NLP and CV models using TensorFlow and PyTorch to process and classify claim-related texts, forms, and images
  • Developed NER, sentiment analysis tools using BERT models via Hugging Face Transformers to extract key entities from claim documents
  • Built ML pipelines with Apache Airflow, Azure Data Factory to preprocess and stream data from on-prem SQL Server and Azure Data Lake
  • Deployed models to production using Azure ML Services and managed versioning with MLflow and DVC
  • Designed RESTful APIs using FastAPI to integrate predictive models into the core claims processing system
  • Containerized model services with Docker and orchestrated deployment via Kubernetes on AKS
  • Achieved 35% reduction in manual claim processing time and improved fraud detection accuracy by 25%

Research Assistant – Generative AI

Indiana University Bloomington Dec 2023 – May 2024
  • Improved transcript accuracy by 18pp using a GPT-4 RAG pipeline deployed on BigRed200, processing over 200 hours of esports videos
  • Reduced latency 40% in chat feature via GPT-4 sentiment analysis microservice, processing 1M+ messages in near real-time
  • Automated retraining pipelines using SLURM on HPC systems, cutting manual ETL effort by 6 hours per match
  • Documented GenAI workflows, adopted by two graduate cohorts for ongoing esports psychology research

Data Analyst

IBM Sep 2020 – Jun 2022
  • Led end-to-end development of a churn prediction model using Python and Scikit-learn, driving a 20% reduction in customer attrition
  • Refactored ETL workflows using Azure Data Lake and SQL, improving data availability and cutting processing time by 15%
  • Built automated data validation pipelines with SQL and Python, raising dashboard reporting accuracy by 18%
  • Deployed ML models to Azure ML environments with CI/CD support, accelerating release cycles by 25%
  • Introduced versioning standards for ML pipelines and datasets, increasing transparency in model updates and audits

Featured Projects

AI Study Buddy Developing

An intelligent learning assistant powered by advanced AI algorithms for optimizing study schedules, adaptive learning recommendations, and interview preparation coaching using scientifically-proven techniques.

React TypeScript AI Algorithms TailwindCSS

AI Job Application Agent Developing

An intelligent job application automation tool that uses DeepSeek's powerful AI API for semantic field matching, contextual response generation, and form analysis - all at ultra-low cost (~$0.14 per 1M tokens).

Python DeepSeek AI MCP Protocol Claude Desktop

Location-Based File Sharing System

Engineered a serverless AWS solution using S3 and Lambda with geospatial filtering, maintaining 99% uptime with efficient access control. Integrated OpenStreetMap via Leaflet.js into a responsive JavaScript frontend.

AWS S3 Lambda Leaflet.js JavaScript

Latest Blog Posts

AI vs Human Brain
Sanket Muchhala August 03, 2025 AI/Philosophy

AI vs Human Brain

A deep dive into the fundamental differences between artificial intelligence and human cognition. Exploring what AI really is, how it works, and the philosophical question of whether machines can truly think.

Read More

Certifications

Professional credentials that validate my expertise in AI/ML and cloud technologies

Verified

AWS Certified Machine Learning Engineer Associate

Advanced AWS ML services and best practices for production ML systems

Amazon Web Services 2024
Verified

Azure Artificial Intelligence Fundamentals (AI-900)

Microsoft Azure AI services and machine learning fundamentals

Microsoft 2024
Verified

Databricks Generative AI Foundations

Comprehensive understanding of generative AI and LLM applications

Databricks 2024
Verified

Google Data Analytics Professional Certificate

Comprehensive data analysis skills using Google tools and methodologies

Google 2021

Get In Touch

Let's Connect

I'm always interested in discussing new opportunities, innovative projects, or just connecting with fellow AI/ML enthusiasts. Feel free to reach out!