
My passion for Artificial Intelligence, Deep Learning, Machine Learning, and NLP is only rivaled by my love for a good cup of coffee.
With over a decade of experience across E-commerce, Finance, Supply Chain, and Product domains, I specialize in building ML/AI solutions from wild idea to real-world deployment. Whether it’s fine-tuning LLMs, orchestrating multi-cloud AI systems, or wrangling data into submission, I’m always up for a challenge (and a bit of fun along the way).
I’ve mentored 100+ aspiring data scientists and AI enthusiasts, and I’m always open to deep technical chats, collaborations, or just geeking out over the latest GenAI breakthroughs. My happy place? Experimenting with new models, building AI systems, and occasionally pretending my code never breaks.
When I’m not busy with RAGs, LLMs, or anomaly detection, you’ll find me writing on Medium and Substack (AI Core Loop), or reminiscing about my days at Amazon and Northeastern University. Want to connect, collaborate, or just swap AI jokes? Drop me a line at negi.sh@husky.neu.edu or find me on LinkedIn.
Staff Data Scientist, Walmart Claim Services
Leading GenAI and data science integration, multi-cloud deployment, and advanced AI/ML model optimization (LLMs, RAG, risk detection, NER, prompt engineering, etc.).
Amazon: Built ML models for supply chain optimization, led analytics for Amazon Prime/Fresh, and developed event-driven pipelines using AWS Lambda, SageMaker, S3, and more.
Masters in Information Systems, Northeastern University, Boston
Bachelors, VIT University, Vellore
United Nations Research: Analyzed migration patterns in Eastern Europe using advanced data science/statistics for public policy.
PyTorch, TensorFlow, HuggingFace, Scikit-learn, LangChain, OpenAI API
Python, R, SQL, Spark, Pandas, NumPy, Jupyter
AWS (Lambda, SageMaker), GCP (Vertex AI), Azure ML, Docker, Kubernetes
Git, CI/CD, REST APIs, FastAPI, Streamlit, MLOps, Model Deployment
Leveraging Vision Transformers for Early Detection of Oral Cancer: A Deep Learning Approach to Medical Imaging
Published in Intelligent and Fuzzy Systems, Artificial Intelligence in Human-Centric, Resilient and Sustainable Industries, Proceedings of the INFUS 2025 Conference, Volume 3
Read on SpringerEnhancing Animal Shelter Operations with Time Series and Machine Learning
Published in SMU Data Science Review Journal
Read on SMU ScholarLeveraging GenAI for Biometric Voice Print Authentication
Published in SMU Data Science Review
Read on SMU ScholarLocation: Bentonville, USA
• Leading GenAI and data science integration for organizational transformation.
• Orchestrating multi-cloud deployment, auto-monitoring, and optimization of advanced AI/ML models.
• Applied LLMs (GPT-4, Llama, FlanT5) to projects in topic modeling, RAG, summarization, risk detection, NER, and prompt engineering.
• Driving innovation in AI-powered solutions for business impact.
Skills: GenAI, LLMs, Cloud AI, ML Engineering, Python, AWS, GCP, Azure, MLOps
• Built end-to-end ML models for supply chain optimization and analytics for Amazon Prime and Fresh.
• Developed event-driven pipelines using AWS Lambda, SageMaker, and S3.
• Led analytics infrastructure, tracked 100+ KPIs, and delivered insights in a fast-paced, startup-like environment.
• Focused on rapid prototyping, agile delivery, and cloud-based ML solutions.
Skills: Machine Learning, AWS, Data Engineering, Analytics, Python, Agile
• Cleaned textual data (around one million records) and performed data munging by transforming and mapping different files provided by IRS
• Performed text mining on financial data based files using SAS and Python using NLTK and Pandas
Skills : R, SAS, Python, NLTK, Pandas, Google Cloud
• Followed agile development methodology on daily basis to perform data analysis and resolve support issues
• Created interactive MicroStrategy dashboards, reports, visual user manuals and feature documentation of the developed software
• Performed data masking by encrypting Protected Health Information(PHI) in SQL Database using AES_256 algorithm
• Worked on specimen transaction databases and developed billing algorithm which may lead to revenue growth of about $10k
• Performed database migration, integration and indexing from legacy third party database to in-house SQL database
Skills : Data Analysis, Data Management, SQL, Microstrategy
• Assisted Republic of Moldova and United Nations Development Program (UNDP) in population estimation through household energy consumption data
• Examined spatial data using ArcGIS(Global Information System) to analyze, and understand patterns of the distribution of Spatial Data, and to establish its relationship with factors like economy, terrain and geographic location
• Performed Linear Regression, Multi-variable Regression and other advanced data statistical predictive model to derive key econometric indicators to deduce shadow economy share in state's economy
• Performed clustering analysis to identify the similarities between growth patterns, economies of post soviet states
Skills : R, Python, Tableau, R Libraries (rpart, ggplot, Regression)
• Worked closely with Finance and Banking business unit to gather business requirements and to prioritize tasks
• Performed ETL to migrate data from multiple sources to designed ecommerce data warehouses using SSIS
• Analyzed data and developed BI reports consisting of KPIs and Scorecards using Tableau to improve corporate profitability measures and comparative marketing, sales and financial analysis
• Designed and developed database for a Web App. for the enterprise business unit
• Researched and analyzed fields which are required for bank transactions, comprehensive data storage of the banks, current practices and compliance for improvising the design and usage
• Created logical and physical database designs along with data structures, procedures and triggers for Hospital Management System using SQL server
Skills : SQL, MySQL, SSIS, Talend, ETL, HTML, CSS, BootStrap
•Assisted the firm in identifying baseline power consumption, engineering based savings and energy usage
• Applied logistic regression to analyze and predict household energy usage (over 6000 household units) in Indore
• Conducted data preparation, and outlier detection using MS SQL server; built the model using R
Skills : R, SQL, Regression, Clustering
The coursework has a blend of technical and management subjects that helps me in growing deeper understanding regarding Information System as a whole. With focus on data analytics, machine learning and visualization, the coursework has provided me modern-day skills to use technology to improvise a business process and leverage data in making the business better.
Cources: Data Structures, C Programming