About Me


Hey there! I’m Shivam, a Staff Data Scientist and AI Engineer at Walmart Claim Services—think of me as your friendly neighborhood AI tinkerer.

My passion for Artificial Intelligence, Deep Learning, Machine Learning, and NLP is only rivaled by my love for a good cup of coffee. With over a decade of experience across E-commerce, Finance, Supply Chain, and Product domains, I specialize in building ML/AI solutions from wild idea to real-world deployment. Whether it’s fine-tuning LLMs, orchestrating multi-cloud AI systems, or wrangling data into submission, I’m always up for a challenge (and a bit of fun along the way).

I’ve mentored 100+ aspiring data scientists and AI enthusiasts, and I’m always open to deep technical chats, collaborations, or just geeking out over the latest GenAI breakthroughs. My happy place? Experimenting with new models, building AI systems, and occasionally pretending my code never breaks.

When I’m not busy with RAGs, LLMs, or anomaly detection, you’ll find me writing on Medium and Substack (AI Core Loop), or reminiscing about my days at Amazon and Northeastern University. Want to connect, collaborate, or just swap AI jokes? Drop me a line at negi.sh@husky.neu.edu or find me on LinkedIn.

Current Role

Staff Data Scientist, Walmart Claim Services
Leading GenAI and data science integration, multi-cloud deployment, and advanced AI/ML model optimization (LLMs, RAG, risk detection, NER, prompt engineering, etc.).

Previous Experience

Amazon: Built ML models for supply chain optimization, led analytics for Amazon Prime/Fresh, and developed event-driven pipelines using AWS Lambda, SageMaker, S3, and more.

Academic Background

Masters in Information Systems, Northeastern University, Boston
Bachelors, VIT University, Vellore
United Nations Research: Analyzed migration patterns in Eastern Europe using advanced data science/statistics for public policy.

Let's collaborate! Open to research, mentoring, and impactful AI projects.
Feel welcome to connect for the latest on AI learning materials and opportunities.

Core Expertise


🤖 Artificial Intelligence 🧠 GenAI & LLMs 🔬 Deep Learning 💬 NLP ⚙️ ML Engineering ☁️ Cloud AI 📚 Research
🚀 AI/ML Frameworks

PyTorch, TensorFlow, HuggingFace, Scikit-learn, LangChain, OpenAI API

💾 Data & Analytics

Python, R, SQL, Spark, Pandas, NumPy, Jupyter

☁️ Cloud Platforms

AWS (Lambda, SageMaker), GCP (Vertex AI), Azure ML, Docker, Kubernetes

🛠️ Development

Git, CI/CD, REST APIs, FastAPI, Streamlit, MLOps, Model Deployment

Recent Publication

Leveraging Vision Transformers for Early Detection of Oral Cancer: A Deep Learning Approach to Medical Imaging

Published in Intelligent and Fuzzy Systems, Artificial Intelligence in Human-Centric, Resilient and Sustainable Industries, Proceedings of the INFUS 2025 Conference, Volume 3

Read on Springer

Enhancing Animal Shelter Operations with Time Series and Machine Learning

Published in SMU Data Science Review Journal

Read on SMU Scholar

Leveraging GenAI for Biometric Voice Print Authentication

Published in SMU Data Science Review

Read on SMU Scholar

More...

Featured Research & Projects


Location: Bentonville, USA

Featured Research & Projects

🔄 Adaptive RAG

Researching adaptive Retrieval-Augmented Generation (RAG) systems for dynamic knowledge integration and context-aware AI solutions.

🤖 Agentic Document Summarizer

Developing agent-based document summarization frameworks leveraging LLMs for multi-source, context-rich summarization.

📊 Trend & Anomalies Detection using GenAI

Building GenAI-powered systems for real-time trend analysis and anomaly detection in large-scale data streams.

🧠 Hierarchical Memory Management

Exploring hierarchical memory architectures for efficient long-context handling and retrieval in AI models.

Experience


  • 2023
    -
    Present

    Walmart Claim Services

    Staff Data Scientist

    • Leading GenAI and data science integration for organizational transformation.
    • Orchestrating multi-cloud deployment, auto-monitoring, and optimization of advanced AI/ML models.
    • Applied LLMs (GPT-4, Llama, FlanT5) to projects in topic modeling, RAG, summarization, risk detection, NER, and prompt engineering.
    • Driving innovation in AI-powered solutions for business impact.

    Skills: GenAI, LLMs, Cloud AI, ML Engineering, Python, AWS, GCP, Azure, MLOps

  • 2020
    -
    2023

    Amazon

    Data Scientist

    • Built end-to-end ML models for supply chain optimization and analytics for Amazon Prime and Fresh.
    • Developed event-driven pipelines using AWS Lambda, SageMaker, and S3.
    • Led analytics infrastructure, tracked 100+ KPIs, and delivered insights in a fast-paced, startup-like environment.
    • Focused on rapid prototyping, agile delivery, and cloud-based ML solutions.

    Skills: Machine Learning, AWS, Data Engineering, Analytics, Python, Agile

  • Jan 2018
    -
    April 2018

    Northeastern University Business School

    Reserach Assistant

    • Cleaned textual data (around one million records) and performed data munging by transforming and mapping different files provided by IRS
    • Performed text mining on financial data based files using SAS and Python using NLTK and Pandas

    Skills : R, SAS, Python, NLTK, Pandas, Google Cloud

  • July 2017
    -
    Dec 2017

    Boston Children's Hospital

    Data Analyst COOP

    • Followed agile development methodology on daily basis to perform data analysis and resolve support issues
    • Created interactive MicroStrategy dashboards, reports, visual user manuals and feature documentation of the developed software
    • Performed data masking by encrypting Protected Health Information(PHI) in SQL Database using AES_256 algorithm
    • Worked on specimen transaction databases and developed billing algorithm which may lead to revenue growth of about $10k
    • Performed database migration, integration and indexing from legacy third party database to in-house SQL database

    Skills : Data Analysis, Data Management, SQL, Microstrategy

  • Jan 2017
    -
    May 2017

    Northeastern University

    Reserach Assistant

    • Assisted Republic of Moldova and United Nations Development Program (UNDP) in population estimation through household energy consumption data
    • Examined spatial data using ArcGIS(Global Information System) to analyze, and understand patterns of the distribution of Spatial Data, and to establish its relationship with factors like economy, terrain and geographic location
    • Performed Linear Regression, Multi-variable Regression and other advanced data statistical predictive model to derive key econometric indicators to deduce shadow economy share in state's economy
    • Performed clustering analysis to identify the similarities between growth patterns, economies of post soviet states

    Skills : R, Python, Tableau, R Libraries (rpart, ggplot, Regression)

  • Sept 2015
    -
    May 2016

    Cognizant Technology Services

    Programmer Analyst

    • Worked closely with Finance and Banking business unit to gather business requirements and to prioritize tasks
    • Performed ETL to migrate data from multiple sources to designed ecommerce data warehouses using SSIS
    • Analyzed data and developed BI reports consisting of KPIs and Scorecards using Tableau to improve corporate profitability measures and comparative marketing, sales and financial analysis
    • Designed and developed database for a Web App. for the enterprise business unit
    • Researched and analyzed fields which are required for bank transactions, comprehensive data storage of the banks, current practices and compliance for improvising the design and usage
    • Created logical and physical database designs along with data structures, procedures and triggers for Hospital Management System using SQL server

    Skills : SQL, MySQL, SSIS, Talend, ETL, HTML, CSS, BootStrap

  • June 2014
    -
    July 2014

    Omne Agate

    Data Analyst Intern

    •Assisted the firm in identifying baseline power consumption, engineering based savings and energy usage
    • Applied logistic regression to analyze and predict household energy usage (over 6000 household units) in Indore
    • Conducted data preparation, and outlier detection using MS SQL server; built the model using R

    Skills : R, SQL, Regression, Clustering

Education


  • 2016
    -
    2018

    Northeastern University

    Master Degree in Information Systems

    The coursework has a blend of technical and management subjects that helps me in growing deeper understanding regarding Information System as a whole. With focus on data analytics, machine learning and visualization, the coursework has provided me modern-day skills to use technology to improvise a business process and leverage data in making the business better.

  • 2011
    -
    2015

    VIT University

    Bachelor Degree in Electronics and Communications

    Cources: Data Structures, C Programming