Fine-tuning – Anubhav Anand

Apr 15, 2026 · 5 minfinetuningllmdeeplearning

Same GPU, same model, same LoRA config — and the run finishes in a third of the time using most of t

Apr 10, 2026 · 5 minfinetuningllmdeeplearning

Nobody demos the data cleaning.

Apr 3, 2026 · 5 minfinetuningllmdeeplearninggraph

You proved the task is solvable.

Mar 30, 2026 · 5 minfinetuningllmdeeplearning

DPO answered the common case.

Mar 20, 2026 · 5 minfinetuningllmdeeplearning

For a couple of years, teaching a model to prefer good answers over bad ones meant running three mod

Mar 12, 2026 · 4 minfinetuningllmdeeplearning

Four methods, one question: when you sit down to fine-tune, which do you reach for?

Mar 7, 2026 · 5 minfinetuningllmdeeplearning

Try to full-fine-tune an 8B model on a single 24 GB consumer card and you won't get to the first tra

Mar 3, 2026 · 5 minfinetuningllmdeeplearning

A 7-billion-parameter model has 7 billion knobs.

Feb 23, 2026 · 5 minfinetuningllmdeeplearning

Most fine-tuning projects should have stayed a prompt.