Show HN: DistilKitPlus, a distillation framework between any LLMs

9 ayushnangia16 4 5/5/2025, 4:12:05 PM github.com ↗

Over the past few months, I have built a distillation toolkit that supports cross-tokenizer distillation (e.g., distilling from LLaMA to Qwen vocab, or others). This approach has worked well on reasoning datasets like AIME, and we’ve validated on models like Phi and Qwen.

We’ve also integrated Modal for quick deployment (with $30/month credits to try it out).

Would love any feedback!

GitHub: https://github.com/agokrani/distillKitPlus

Docs: https://distillkitplus.mintlify.app/

Comments (4)

vikramxD · 2h ago

Cool , are you accepting contributions for adding new models

vijit-singh · 3h ago

this is very cool. will try it out.

shikharM07 · 5h ago

this is kinda interesting but I'm curious what is the smallest model size that I can distill without compromising the accuracy?

agokrani · 4h ago

We can distill 14B model to 4B model with performance improvements on AIME24 and GSM8K. We will share our results with a detailed blog post later.

Instant (YC S22) Is Hiring a Founding TypeScript Engineer (instantdb.com)

Jiga (YC W21) Is Hiring Engineers (workatastartup.com)

KaiPod Learning (YC S21) Is Hiring VP of Engineering (ycombinator.com)

Hightouch (YC S19) Is Hiring (ycombinator.com)

Helpcare AI (YC F24) Is Hiring (docs.google.com)

Stellar Sleep (YC S23) is hiring a product engineer in SF (ycombinator.com)

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer

Toma (YC W24) Is Hiring Engs #3-4 (AI for Automotive) (ycombinator.com)

Waypoint Transit (YC W25) is hiring a software engineer (workatastartup.com)

GroMo (YC W21) Is Hiring (ycombinator.com)

Archil (YC F24) Is Hiring a Distributed Systems Engineer (In-Person, SF)

Modern Realty (YC S24) Is Hiring (workatastartup.com)

Hestus, Inc. (YC S24) Is Hiring an ML Engineer to Revolutionize CAD (ycombinator.com)

Activeloop (YC S18) is hiring a VP of Engineering in Mountain View (on-site) (careers.activeloop.ai)

Optery (YC W22) – Engineering Team Lead and Engineers with Node.js (U.S., Latam) (jobs.ashbyhq.com)

Extend (YC W23) is hiring engineers to build LLM document processing (jobs.ashbyhq.com)

Parity (YC S24) is hiring founding engineers to build an AI SRE (in-person, SF) (ycombinator.com)

Freshpaint (YC S19) is hiring back end and front end engineers (Remote, US only)

MobileBoost (YC S21) Is Hiring a Founding Back End/Platform Engineer (Remote) (ycombinator.com)

Gym Class (YC W22) Is Hiring Character Animation Engineering Lead (ycombinator.com)

Foundry (YC F24) is hiring – Come build a world model for the web

Bild AI (YC W25) is hiring a founding engineer in SF (ycombinator.com)

Tenjin (YC S14) Is Hiring a Senior Ad Attribution Engineer (Ruby, Go) (ycombinator.com)

Onyx (YC W24) Is Hiring for ML Engineer (ycombinator.com)

Recover (YC W21) Is Hiring (ycombinator.com)

GiveCampus (YC S15) Is Hiring Sr engineers passionate about education (givecampus.breezy.hr)

Cekura (Formerly Vocera) (YC F24) Is Hiring (ycombinator.com)

Spark AI (YC W24) is hiring a full-stack engineer in San Francisco (ycombinator.com)

FurtherAI (YC W24) Is Hiring Software and AI Engineers (ycombinator.com)

Weave (YC W25) is hiring a founding engineer (ycombinator.com)

Infisical (YC W23) Is Hiring Design Engineer in San Francisco (ycombinator.com)

Show HN: DistilKitPlus, a distillation framework between any LLMs

Comments (4)