Sarthak Chauhan CodeNinjaSarthak

AI Engineer focused on production GenAI systems, reliable LLM infrastructure, retrieval systems, and robustness evaluation under real-world constraints.

About Me

class SarthakChauhan:

    role = [
        "AI Engineer",
        "ML Systems Builder",
        "Research Engineer"
    ]

    interests = [
        "Reliable LLM Systems",
        "Distributed Inference",
        "Retrieval-Augmented Generation",
        "Vision Robustness",
        "Temporal Memory Systems",
        "Evaluation Under Distribution Shift"
    ]

    currently_building = [
        "Production-scale GenAI infrastructure",
        "Long-context retrieval and reranking systems",
        "Low-latency async AI systems",
        "Reliable LLM evaluation pipelines"
    ]

What I Work On

⚡ Production AI Systems

Built production LLM systems serving 1000+ users
Reduced generation latency from 21s → 6s
Designed async orchestration using asyncio.gather
Built provider fallback routing: Azure → Claude / Gemini
Engineered Redis worker pipelines with bounded concurrency
Implemented SSE streaming, rate limiting, and circuit breakers

🔬 Research & Evaluation

3 IEEE publications (2 first-author)
Working on memory systems for temporal reasoning
Evaluating robustness under distribution shift
Benchmarking calibration across vision architectures
Researching retrieval quality and reranking systems
Building RL environments for AI safety evaluation

Featured Work

🚀 SafeAct-Env

AI Safety RL Environment Finalist — Meta × Scaler PyTorch OpenEnv Hackathon (Top 2.6%)

Multi-task RL environment for reversible vs irreversible actions
Deterministic graders with hidden risk classifier
164 passing tests with reproducible evaluation
Built across infra, filesystem, DB, and medical safety tasks

Stack: Python FastAPI Docker RL

🧠 Eidetic Memory

Memory System for Conversational AI

Achieved 56.3% LoCoMo QA
+39.3 pp temporal improvement over RAG baseline
Per-speaker memory isolation + neural reranking
Averaged only 1.9 LLM calls/query

Focus Areas: Temporal reasoning • retrieval • reranking • memory systems

Stack: FastAPI Qdrant Cross-Encoder LLMs

⚡ StreamMind

Real-Time Semantic Question Clustering

Reduced instructor response time by 68%
Designed fault-tolerant async processing pipeline
Handled 100+ concurrent doubts
Semantic deduplication using online clustering

Infra: Redis workers • pgvector • WebSockets • circuit breakers

Stack: FastAPI Redis pgvector Gemini

🏫 Medha AI

Production GenAI System @ Cograd

Serving curriculum-aligned generation workflows
Reduced lesson-plan latency 3.5×
Reduced exam generation latency 2.5×
Multi-provider orchestration with graceful degradation
Multi-HyDE retrieval + reranking pipeline

Infra: Async orchestration • Redis • Qdrant • Azure OpenAI

Stack: FastAPI Redis Qdrant MongoDB

Selected Research

Vision Robustness & Calibration

Evaluating 12 ImageNet-pretrained architectures across IN-Val, IN-V2, IN-R, IN-A, and IN-C using:

ECE
AURC
selective prediction
corruption robustness
universal failure analysis

Dense-Fog Highway Dehazing

Benchmarked 10 dehazing architectures and identified a 15–20 dB PSNR gap between synthetic benchmarks and real dense-fog highway conditions.

Hinglish Abuse Detection

Improved F1 from 0.784 → 0.866 on a 700K-post dataset using:

XLM-R transfer learning
BiGRU attention fusion
multilingual representation learning

Publications

📄 Hinglish Abusive Comment Detection Using Transformer-Based Models

AICAPS 2026 — IEEE Kerala Section First Author

📄 Image and Video Dehazing for Dense-Fog Indian Highway Scenarios

DICCT 2026 First Author

📄 Deep Learning-Based Brain Tumour Identification

IC3SE 2025 — IEEE UP Section Second Author

Tech Stack

Languages & ML

LLM & Retrieval

Systems & Infra

Achievements

🏆 Meta × Scaler PyTorch OpenEnv Hackathon — Finalist (Top 2.6%)
🏆 Amazon ML Challenge 2024 — Top 0.5%
🏆 IIT Bombay Convolve — Top 50 / 4189 teams
🎓 Dean’s List — Top 10%
📚 GPA: 9.42 / 10.0

GitHub Stats

Activity Graph

Connect

Building reliable AI systems, retrieval infrastructure, and evaluation pipelines.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sarthak Chauhan CodeNinjaSarthak

Achievements

Achievements

Block or report CodeNinjaSarthak

About Me

What I Work On

⚡ Production AI Systems

🔬 Research & Evaluation

Featured Work

🚀 SafeAct-Env

🧠 Eidetic Memory

⚡ StreamMind

🏫 Medha AI

Selected Research

Vision Robustness & Calibration

Dense-Fog Highway Dehazing

Hinglish Abuse Detection

Publications

📄 Hinglish Abusive Comment Detection Using Transformer-Based Models

📄 Image and Video Dehazing for Dense-Fog Indian Highway Scenarios

📄 Deep Learning-Based Brain Tumour Identification

Tech Stack

Languages & ML

LLM & Retrieval

Systems & Infra

Achievements

GitHub Stats

Activity Graph

Connect

Pinned Loading

Uh oh!