Ashutosh Kushwaha

Data Scientist | M.Tech | IIT Bombay | 6.5+ Years Exp

About Me

A little about me,

Hello, I'm Ashutosh!

Data Scientist with 6.5+ years of experience building large-scale ML systems across fintech and ecommerce. Specialized in Uplift/propensity modeling, risk & fraud detection, and personalization/ranking.

I have a proven track record of taking models from EDA to production with measurable business impact (CTR, GMV, conversion). I possess a strong background in distributed systems, MLOps, and experimentation-driven development.

Regarding my education background, I did my Masters in Technology in Industrial Engineering and Operations Research (IEOR), at the prestigious Indian Institute of Technology (IIT) Bombay.

Technical Skills

PythonSQLPySparkDatabricksMLflowFastAPIDockerAWS (S3, EC2, SageMaker)AzurePostgreSQLGitMachine LearningPropensity ModelingA/B TestingDistributed Systems

Experience

Where I've worked

bash - ashutosh@portfolio ~

➜~/experience~ls -1

➜~/experience~cat tata-digital.md

Data Scientist - II

@Tata Digital (Tata Neu)

Date
June 2024 – PresentLoc
Bengaluru, India

Designed a multi-stage propensity modeling framework with funnel-aware feature engineering and stage-wise intent prediction for Insurance, Investments, and Credit Cards, enabling precision targeting for 50M+ users and improving funnel conversion by 19 BPS.

Operationalized weekly scoring pipelines for propensity models (stacked ensemble of XGBoost + DNN) and integrated outputs with the outbound CRM platform/call center to enable automated, timely targeting of high-intent users.

Led end-to-end development of a zone-aware trend discovery and product ranking system for Tata CLiQ merchandising, partnering with product and engineering to define requirements, productionize models, and run live A/B experiments, resulting in +0.34% carousel CTR uplift across regions.

Built a scalable, configurable training framework with automated large-scale feature selection (10k+ features), variance filtering, and multi-model voting-based feature subset selection.

Developed flexible Spark/AutoGluon training pipelines with integrated MLflow model registry support on Databricks, enabling automated packaging, versioning, and inference-ready deployment.

Architected and productionized a high-throughput Insurance Anti-Fraud service using FastAPI, achieving a P99 latency of 115ms for real-time screening.

➜~/experience~

Tech Stack

Tech I'm familiar with

Python

PyTorch

Tensorflow

Scikit-Learn

Apache Spark

Pandas

Git

AWS SageMaker

Docker

FastAPI