Available for opportunities

Kartikeya Agarwal

|

I build production ML systems that serve 10K+ RPS at <20ms, fine-tune LLMs that unlock $20Mil yearly disbursals, and optimize infrastructure saving $250K+ annually.

01

About

Kartikeya Agarwal

Machine Learning Engineer with 3+ years of experience building production ML systems at Navi Technologies. I work across the full ML lifecycle — from fine-tuning LLMs for SMS entity recognition to developing credit risk models that improved approval rates by 150bps.

I'm driven by optimizing systems and processes: reducing model training costs by 34%, saving over $250,000 yearly through AWS resource optimization, and architecting real-time feature serving at sub-20ms p99 latency. Currently focused on MLOps infrastructure and GenAI applications in financial services.

0+
Years Experience
0
Published Papers
$0K+
Yearly Savings
0bps
Approval Uplift
Python PyTorch LLMs XGBoost CatBoost FastAPI MLflow Kafka PostgreSQL AWS Databricks PySpark
02

Experience

Navi Technologies

Machine Learning Engineer · July 2023 – Present · Bangalore

Credit Risk Modeling

Improved PrismV6 model by engineering new temporal features and replacing XGBoost retraining with a multi-modal pipeline for a direct 150bps approval rate increase. Built income models using XGBoost + LightGBM improving accuracy within 10% by 6 points.

+150 bps Approval Rate Increase
46% Retraining Time Reduction
34% Cost Reduction

Spade Real Time

Architected a family of 3 services serving SMS, app, device and location features in real-time. Leveraged Reactive Kotlin with Postgres, EFS, S3, and Kafka. Implemented Trie-based template searching to cut runtimes by 70%.

<20ms P99 Latency
10K+ Requests Per Second
70% Runtime Cut (Trie)

Model Updation Service

Designed a comprehensive MLOps platform with FastAPI backend, PostgreSQL, MLflow integration for model versioning. Automated pipeline switching between QA and prod environments with JIRA & Slack notifications. Completely removed manual deployment effort.

Zero Manual Deployment
100% Automated Versioning

Cost Optimization & Infra

Built Databricks access control and cost attribution framework with a resource monitor. Used a regression framework on compute patterns to recommend optimizations, minimizing wasted compute. Optimized feature-selection jobs (RFE/IV/Boruta) using Ray and tensors.

$250K+ Yearly Savings
80%+ Feature Selection Speedup

Automated Feature Store

Designed and built an entirely automated feature store with metrics and alerts, ensuring 99.99% feature consistency across prod and dev. Rearchitected the pipeline consumed by Data Science, Risk, and Analytics teams.

99.99% Feature Consistency
50%+ Faster Than Legacy
2x More Cost Efficient
03

Projects

01

Auto Code Sequence Generator

Attention model generating production-ready code from natural language and wireframe diagrams. Custom token vectors support multi-framework translation (React, Angular).

AttentionNLPCode GenTransformers
02

Visual Aid for the Blind

IoT device using ESP32 + Arduino Nano capturing images over WiFi with real-time scene descriptions via YOLOv3 and Transformer model, trained on MS COCO.

YOLOv3TransformersESP32IoT
03

AI Image Caption Bot

Image captioning model with 1.5M+ parameters combining LSTMs and CNNs, trained on Flickr30k for automatic description generation.

LSTMCNNFlickr30kNLP
04

Research & Publications

Indian Journal of Computer Science

Early Detection of COVID-19 using Machine Learning

Utilized ResNet50 to extract features and distinguish COVID-19 from normal lung X-rays and pneumonia, achieving 99% accuracy on diagnostic imaging.

May – July 2021 ResNet50 · Computer Vision
arXiv

Classification of Skin Cancer Images using CNNs

CNN-based model to classify skin lesions into Benign and Malignant with 86%+ accuracy using XceptionNet for segmentation and custom Dense Network.

Apr – May 2021 Read on arXiv
05

Get in Touch

I'm always open to discussing ML engineering roles, interesting projects, or opportunities to collaborate. Let's build something remarkable.

kartikeya72001@gmail.com
gradient_descent.py