Available for opportunities

Kartikeya Agarwal

|

I build production ML systems that serve 10K+ RPS at <25ms, fine-tune LLMs that unlock INR 15Cr monthly, and optimize infrastructure saving $18K/month.

01

About

Kartikeya Agarwal

Machine Learning Engineer with 3+ years of experience building production ML systems at Navi Technologies. I work across the full ML lifecycle — from fine-tuning LLMs for SMS entity recognition to developing credit risk models that improved approval rates by 150bps.

I'm driven by optimizing systems and processes: reducing model training costs by 31%, saving up to $18,000 monthly through AWS resource optimization, and architecting real-time feature serving at sub-25ms p99 latency. Currently focused on MLOps infrastructure and GenAI applications in financial services.

0+
Years Experience
0
Published Papers
$0K+
Infra Savings
0bps
Approval Uplift
Python PyTorch LLMs XGBoost CatBoost FastAPI MLflow Kafka PostgreSQL AWS Databricks PySpark
02

Experience

Navi Technologies

Machine Learning Engineer · July 2023 – Present · Bangalore

Credit Risk Modeling

Replaced XGBoost with CatBoost for PrismV6 risk model, engineering new features for a direct 150bps approval rate increase. Built income models using XGBoost + LightGBM improving accuracy within 10% by 6 points.

+150 bps Approval Rate Increase
46% Retraining Time Reduction
31% Cost Reduction

Spade Real Time

Architected a family of 4 services serving SMS, app, device and location features in real-time. Leveraged Reactive Kotlin with Postgres, Scylla, EFS, S3, and Kafka. Implemented Trie-based template searching to cut runtimes by 70%.

<25ms P99 Latency
10K+ Requests Per Second
70% Runtime Cut (Trie)

Model Deployment Platform

Designed a comprehensive MLOps platform with FastAPI backend, PostgreSQL, MLflow integration for model versioning. Automated pipeline switching between prod environments with JIRA & Slack notifications.

60% Less Manual Effort
99.99% Feature Consistency

Cost Optimization & Infra

Built Databricks access control framework and AWS resource usage regulator. Analyzed compute patterns and built a model to recommend resource reductions. Optimized feature-selection jobs (RFE/IV/Boruta) using Ray and tensors.

$18K/mo AWS Savings
50%+ Feature Store Speedup
03

Projects

01

Auto Code Sequence Generator

Attention model generating production-ready code from natural language and wireframe diagrams. Custom token vectors support multi-framework translation (React, Angular).

AttentionNLPCode GenTransformers
02

Visual Aid for the Blind

IoT device using ESP32 + Arduino Nano capturing images over WiFi with real-time scene descriptions via YOLOv3 and Transformer model, trained on MS COCO.

YOLOv3TransformersESP32IoT
03

AI Image Caption Bot

Image captioning model with 1.5M+ parameters combining LSTMs and CNNs, trained on Flickr30k for automatic description generation.

LSTMCNNFlickr30kNLP
04

Research & Publications

Indian Journal of Computer Science

Early Detection of COVID-19 using Machine Learning

Utilized ResNet50 to extract features and distinguish COVID-19 from normal lung X-rays and pneumonia, achieving 99% accuracy on diagnostic imaging.

May – July 2021 ResNet50 · Computer Vision
arXiv

Classification of Skin Cancer Images using CNNs

CNN-based model to classify skin lesions into Benign and Malignant with 86%+ accuracy using XceptionNet for segmentation and custom Dense Network.

Apr – May 2021 Read on arXiv
05

Get in Touch

I'm always open to discussing ML engineering roles, interesting projects, or opportunities to collaborate. Let's build something remarkable.

kartikeya72001@gmail.com
gradient_descent.py