Kartikeya Agarwal

Machine Learning Engineer with 3+ years of experience building production ML systems at Navi Technologies. I work across the full ML lifecycle — from fine-tuning LLMs for SMS entity recognition to developing credit risk models that improved approval rates by 150bps.

I'm driven by optimizing systems and processes: reducing model training costs by 34%, saving over $250,000 yearly through AWS resource optimization, and architecting real-time feature serving at sub-20ms p99 latency. Currently focused on MLOps infrastructure and GenAI applications in financial services.

0+

Years Experience

0

Published Papers

$0K+

Yearly Savings

0bps

Approval Uplift

Python PyTorch LLMs XGBoost CatBoost FastAPI MLflow Kafka PostgreSQL AWS Databricks PySpark

Navi Technologies

Machine Learning Engineer · July 2023 – Present · Bangalore

Featured Work

GenAI & LLM Engineering

LLaMA 3Mistral 7BQLoRABiLSTM-CRF

LLM Fine-Tuning for SMS Entity Recognition

Proposed and implemented parameter-efficient fine-tuning of Mistral 7B and LLaMA 3 8B using QLoRA for SMS entity recognition. Distilled into BiLSTM-CRF with BIO tagging to solve production latency constraints. Eliminated manual tagging dependency and reduced turnaround by 36 hours.

Multi-Agent LLM Underwriting Engine

Built an end-to-end guided re-verification platform for rejected loan applicants. Integrated in-house ITR & GST parser (IGNIS) with a multi-agent LLM engine and multi-lingual chatbot. Orchestrated a dynamic 3–5 model ensemble generating 150+ features.

+36 bps Approval Rate

$20Mil Yearly Disbursal Uplift

36h TAT Reduction

10–15L/mo Cost Savings

Credit Risk Modeling

Improved PrismV6 model by engineering new temporal features and replacing XGBoost retraining with a multi-modal pipeline for a direct 150bps approval rate increase. Built income models using XGBoost + LightGBM improving accuracy within 10% by 6 points.

+150 bps Approval Rate Increase

46% Retraining Time Reduction

34% Cost Reduction

Spade Real Time

Architected a family of 3 services serving SMS, app, device and location features in real-time. Leveraged Reactive Kotlin with Postgres, EFS, S3, and Kafka. Implemented Trie-based template searching to cut runtimes by 70%.

<20ms P99 Latency

10K+ Requests Per Second

70% Runtime Cut (Trie)

Model Updation Service

Designed a comprehensive MLOps platform with FastAPI backend, PostgreSQL, MLflow integration for model versioning. Automated pipeline switching between QA and prod environments with JIRA & Slack notifications. Completely removed manual deployment effort.

Zero Manual Deployment

100% Automated Versioning

Cost Optimization & Infra

Built Databricks access control and cost attribution framework with a resource monitor. Used a regression framework on compute patterns to recommend optimizations, minimizing wasted compute. Optimized feature-selection jobs (RFE/IV/Boruta) using Ray and tensors.

$250K+ Yearly Savings

80%+ Feature Selection Speedup

Automated Feature Store

Designed and built an entirely automated feature store with metrics and alerts, ensuring 99.99% feature consistency across prod and dev. Rearchitected the pipeline consumed by Data Science, Risk, and Analytics teams.

99.99% Feature Consistency

50%+ Faster Than Legacy

2x More Cost Efficient

01

Auto Code Sequence Generator

Attention model generating production-ready code from natural language and wireframe diagrams. Custom token vectors support multi-framework translation (React, Angular).

AttentionNLPCode GenTransformers

02

Visual Aid for the Blind

IoT device using ESP32 + Arduino Nano capturing images over WiFi with real-time scene descriptions via YOLOv3 and Transformer model, trained on MS COCO.

YOLOv3TransformersESP32IoT

03

AI Image Caption Bot

Image captioning model with 1.5M+ parameters combining LSTMs and CNNs, trained on Flickr30k for automatic description generation.

LSTMCNNFlickr30kNLP

Indian Journal of Computer Science

Early Detection of COVID-19 using Machine Learning

Utilized ResNet50 to extract features and distinguish COVID-19 from normal lung X-rays and pneumonia, achieving 99% accuracy on diagnostic imaging.

May – July 2021 ResNet50 · Computer Vision

arXiv

Classification of Skin Cancer Images using CNNs

CNN-based model to classify skin lesions into Benign and Malignant with 86%+ accuracy using XceptionNet for segmentation and custom Dense Network.

Apr – May 2021 Read on arXiv

I'm always open to discussing ML engineering roles, interesting projects, or opportunities to collaborate. Let's build something remarkable.

kartikeya72001@gmail.com

About

Experience