Mansi Mane

Mansi Mane photo

Email:mansimane(FIVE)(AT)gmail(DOT)com, CV , GitHub , LinkedIn

I am working as a Staff Machine Learning Scientist at Walmart and my interests include Machine Learning applications in Recommender Systems, Natural Language Processing and Computer Vision. Previously, I was working on LLM billion scale pre-training as Applied Scientist at Amazon. I did masters from Carnegie Mellon University focused on Machine Learning.

Work Experience

  • Staff Machine Learning Scientist, Walmart, Sunnyvale, USA

    • Multi-objective recommendation systems across home page and item page. Achieved 34bps CTR lift and 11bps ATCs lift across 100M+ customers.
    • Personalized Push notification recommendation system. Drove $17M+ GMV per week.
    • LLM-powered campaign to product type mapping via RAG for 6K product types, reducing 2 weeks manual effort to an hour. 0.74 hit rate.
    • Recipe generation pipeline using GPT and diffusion models, 71% title and 14% cooking step accuracy.
  • Applied Scientist II, Amazon (AWS), Santa Clara, USA

    • Trained billion scale parameter NLP model from scratch with minimal loss in accuracy. To enable customers to train such models seamlessly on AWS, worked on SageMaker Model Parallel and HuggingFace integration which is being used by 30% of distributed training customers
    • Researched different batch size scaling algorithms which reduced training time by upto 3 times for ResNet training.
    • Developed deep learning approaches like Siamese and triplet network in TensorFlow by using multi-modal attributes like image and text data for complementary item recommendations.
    • Worked on making PyTorch available in deep learning containers which is being used by 10000 users per week.
    • Built deep learning infrastructure using the following AWS services: S3, EC2, ECR, SageMaker, CloudFormation, CloudWatch, CodeBuild, IAM.
  • Machine Learning Scientist, Walmart Labs, Sunnyvale, USA

    • Deployed personalized item recommendation model (matrix factorization) serving 10M users, 8bps add-to-cart lift.
    • Siamese network for complementary recommendations, 15bps CTR lift.
    • Sequence-to-sequence machine translation for 20K product titles for voice assistants.
  • Research Internship, CyLab Biometrics Center, CMU, Pittsburgh, USA

    • Weakly supervised object detection using AlexNet in PyTorch; 0.17mAP on PASCAL VOC.
    • Pre-processed data for face parsing using Fully Convolutional Instance Aware Semantic Segmentation.
    • Segregated medical images containing nucleus, cells and not containing those for pap-smear test by applying techniques like Gaussian smoothening, threshoding etc.
    • Synthesized 2D face images at different poses by modifying existing code to project 3D fitted morphable model for face into 2D.

Publications

Workshop Organization

Other Machine Learning Projects