Mansi Mane
I am working as a Staff Data Scientist at Walmart and my interests include Machine Learning applications in Computer Vision, and Natural Language Processing. I did masters from Carnegie Mellon University focused on Machine Learning.
Publications
Campaign-2-PT-RAG: LLM-Guided Semantic Product Type Attribution for Scalable Campaign Ranking
Yiming Che, Mansi Mane, Keerthi Gopalakrishnan, Parisa Kaghazgaran, Murali Mohana Krishna Dandu, Archana Venkatachalapathy, Sinduja Subramaniam, Yokila Arora, Evren Korpeoglu, Sushant Kumar, Kannan Achan
Accepted at LLM & Agents for Recommendation Systems (LARS), WWW 2026
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit Mane, Shashank Kedia, Aditya Mantha, Stephen Guo, Kannan Achan
Revised version accepted at The Web Conference (WWW) 2021
Complementary-Similarity Learning using Quadruplet Network (Code)
Mansi Ranjit Mane, Stephen Guo, Kannan Achan
Presented at Workshop on Recommender Systems in Fashion, ACM Recommender Systems (RecSys) 2019
Deep Learning based Head and Tail Localization of C. elegans (Code)
Mansi Ranjit Mane, Aniket Anand Deshmukh, Adam Iliff
Presented at ICML 2019 Workshop on Computational Biology
Work Experience
Staff Data Scientist, Walmart, Sunnyvale, USA
- Leading the design and development of multi-objective recommendation systems across home page, item page, and push notification surfaces, collaborating with engineering teams on system architecture and end-to-end deployment to drive engagement and conversion at scale.
- Developed LLM-powered RAG (Retrieval-Augmented Generation) pipelines for automated product attribute generation, improving catalog quality and enrichment.
- Lead recipe image generation initiative using diffusion models, working closely with engineering teams on system design and infrastructure to produce high-quality, photorealistic food imagery from recipe data.
Applied Scientist II, Amazon (AWS), Santa Clara, USA
- Trained billion scale parameter NLP model from scratch with minimal loss in accuracy. To enable customers to train such models seamlessly on AWS, worked on SageMaker Model Parallel and HuggingFace integration which is being used by 30% of distributed training customers
- Researched different batch size scaling algorithms which reduced training time by upto 3 times for ResNet training.
- Developed deep learning approaches like Siamese and triplet network in TensorFlow by using multi-modal attributes like image and text data for complementary item recommendations.
- Worked on making PyTorch available in deep learning containers which is being used by 10000 users per week.
- Built deep learning infrastructure using the following AWS services: S3, EC2, ECR, SageMaker, CloudFormation, CloudWatch, CodeBuild, IAM.
Data Scientist, Walmart Labs, Sunnyvale, USA
- Built machine learning models and data pipelines in Hive & PySpark for large scale item recommendations with 20 milion items.
- Deployed matrix factorization model for personalized item recommendations for 10M users which resulted in 0.08% gain in add to cart rate in online A/B test
- Developed siamese networks model for complementary item recommendations with in 0.1% gain in click through rate in online A/B test
- Developed machine translation model to generate product titles for 20,000 items sold with voice assistants
Research Assistant, CyLab Biometrics Center, CMU, Pittsburgh, USA
- Pre-processed data for face parsing using Fully Convolutional Instance Aware Semantic Segmentation.
- Segregated medical images containing nucleus, cells and not containing those for pap-smear test by applying techniques like Gaussian smoothening, threshoding etc.
- Synthesized 2D face images at different poses by modifying existing code to project 3D fitted morphable model for face into 2D.
Other Machine Learning Projects
Zero Shot Learning for Image Classification: (Project Report) (Code)
2D GANs with Capsule Networks, CMU: (Project Report) (Code)
Weakly Supervised Object Detection and Localization: (Code)
Fast Super-Resolution CNN (FSRCNN): (Presentation) (Code)