Fine-tuning · Education AI
AI Feynman Kannada Tutor
Multi-stage fine-tuning pipeline creating a reasoning-first physics tutor in Kannada — combining SFT and RAG for intuitive, grounded explanations.
- —Multi-stage SFT: language → domain → grounding
- —LLM-as-judge evaluation on 0–5 scale
- —RAG with physics knowledge base
- —4-model progression with measurable gains
- —Dataset and models on HuggingFace
View case study →Fine-tuning · Creative AI
AI Sitcom Scriptwriter
Teaching an open-source LLM to write The Office — reasoning-first screenplay generation with on-brand humor, character voice, and multi-step setups.
- —SFT on reasoning traces + screenplay pairs
- —Reinforcement fine-tuning (RFT) with PPO
- —LLM-as-judge with 8 weighted metrics
- —3-model progression: Base → SFT → RFT
- —Dataset and models on HuggingFace
View case study →