Home Background
Ashutosh Kushwaha

Ashutosh Kushwaha

Data Scientist | M.Tech | IIT Bombay | 6.5+ Years Exp

About Me

A little about me,

Hello, I'm Ashutosh!

Data Scientist with 6.5+ years of experience building large-scale ML systems across fintech and ecommerce. Specialized in Uplift/propensity modeling, risk & fraud detection, and personalization/ranking.

I have a proven track record of taking models from EDA to production with measurable business impact (CTR, GMV, conversion). I possess a strong background in distributed systems, MLOps, and experimentation-driven development.

Regarding my education background, I did my Masters in Technology in Industrial Engineering and Operations Research (IEOR), at the prestigious Indian Institute of Technology (IIT) Bombay.

Technical Skills

PythonSQLPySparkDatabricksMLflowFastAPIDockerAWS (S3, EC2, SageMaker)AzurePostgreSQLGitMachine LearningPropensity ModelingA/B TestingDistributed Systems

Experience

Where I've worked

bash - ashutosh@portfolio ~
~ls -1
~cat tata-digital.md

Data Scientist - II

Date
June 2024 – Present
Loc
Bengaluru, India
-

Designed a multi-stage propensity modeling framework with funnel-aware feature engineering and stage-wise intent prediction for Insurance, Investments, and Credit Cards, enabling precision targeting for 50M+ users and improving funnel conversion by 19 BPS.

-

Operationalized weekly scoring pipelines for propensity models (stacked ensemble of XGBoost + DNN) and integrated outputs with the outbound CRM platform/call center to enable automated, timely targeting of high-intent users.

-

Led end-to-end development of a zone-aware trend discovery and product ranking system for Tata CLiQ merchandising, partnering with product and engineering to define requirements, productionize models, and run live A/B experiments, resulting in +0.34% carousel CTR uplift across regions.

-

Built a scalable, configurable training framework with automated large-scale feature selection (10k+ features), variance filtering, and multi-model voting-based feature subset selection.

-

Developed flexible Spark/AutoGluon training pipelines with integrated MLflow model registry support on Databricks, enabling automated packaging, versioning, and inference-ready deployment.

-

Architected and productionized a high-throughput Insurance Anti-Fraud service using FastAPI, achieving a P99 latency of 115ms for real-time screening.

~

Tech Stack

Tech I'm familiar with

Python
Python
PyTorch
PyTorch
Tensorflow
Tensorflow
Scikit-Learn
Scikit-Learn
Apache Spark
Apache Spark
Pandas
Pandas
Git
Git
AWS SageMaker
AWS SageMaker
Docker
Docker
FastAPI
FastAPI