I am Founding Member of Technical Staff at a stealth startup, working on post-training and distillation for language models and agents. I recently completed my MS in Machine Learning at Carnegie Mellon University (CMU), where I was a Research Scholar advised by Fernando De la Torre and closely collaborating with Raviteja Vemulapalli and Oncel Tuzel from Apple Machine Learning Research (MLR).
Before that, I was a Predoctoral Fellow at the Vision and AI Lab (VAL), Indian Institute of Science, where I was advised by R. Venkatesh Babu and worked with Sravanti Addepalli and Harsh Rangwani. I also collaborated with Prof. Anirban Chakraborty from the Visual Computing Lab (VCL), IISc. I completed my B.Tech in Computer Science and Engineering (2018–2022) from PES University, Bangalore.
Distillation. Frontier models are powerful but locked behind APIs — you can query them but not inspect or modify their weights. I'm interested in how much of their capability can be recovered through careful distillation, and what data regimes and query strategies make this transfer most effective.
Model diffing. As models evolve rapidly, it's hard to know what actually changed between versions beyond a few benchmark numbers. I want to develop better tools for comparing models — frameworks that surface meaningful behavioral differences and track compatibility across a model family over time.
Distribution shift. I've also worked on getting models to transfer reliably when the test distribution differs from training. This spans transformer-based approaches for source-free adaptation, vision-language supervision for cross-domain generalization, long-tail strategies for visual recognition, and federated methods for fine-tuning large models efficiently.
* Equal contribution