Territorial Markings as a Predictor of Driver Aggression and Road Rage (2008) (onlinelibrary.wiley.com)

I wanted to process raw transcripts locally without OpenRouter. Llama 3.2 3B with a prompt was decent but incomplete, so I tried SFT. I fine-tuned Llama 3.2 3B to clean/analyze dictation and emit structured JSON (title, tags, entities, dates, actions).

Data: 13 real memos → Kimi K2 gold JSON → ~40k synthetic + gold; keys canonicalized. Chutes.ai (5k req/day).

Training: RTX 4090 24GB, ~4h, LoRA (r=128, α=128, dropout=0.05), max seq 2048, bs=16, lr=5e-5, cosine, Unsloth. On 2070 Super 8GB it was ~8h.

Inference: merged to GGUF, Q4_K_M (llama.cpp), runs in LM Studio.

Evals (100-sample, scored by GLM 4.5 FP8): overall 5.35 (base 3B) → 8.55 (fine-tuned); completeness 4.12 → 7.62; factual 5.24 → 8.57.

Head-to-head (10 samples): ~8.40 vs Hermes-70B 8.18, Mistral-Small-24B 7.90, Gemma-3-12B 7.76, Qwen3-14B 7.62. Teacher Kimi K2 ~8.82.

Why: task specialization + JSON canonicalization reduces variance; the model learns the exact structure/fields.

Lessons: train on completions only; synthetic is fine for narrow tasks; Llama is straightforward to train. Dataset pipeline + training script + evals: https://github.com/bilawalriaz/local-notes-transcribe-llm

Comments (0)

No comments yet

Adaptive LLM routing under budget constraints (arxiv.org)

Cloudflare Radar: AI Insights (radar.cloudflare.com)

Making Minecraft Spherical (bowerbyte.com)

Ask HN: Who is hiring? (September 2025)

Bear is now source-available (herman.bearblog.dev)

Effective learning: Rules of formulating knowledge (1999) (supermemo.com)

The Steve Ballmer Interview (acquired.fm)

Show HN: woomarks, transfer your Pocket links to this app or self-host it (woomarks.com)

One of Britain's largest stocks of second-hand books ever amassed (worldofinteriors.com)

Optery (YC W22) Is Hiring in Engineering, Legal, Sales, Marketing (U.S., Latam) (optery.com)

AI enters the grant game, picking winners (science.org)

Territorial Markings as a Predictor of Driver Aggression and Road Rage (2008) (onlinelibrary.wiley.com)

CocoaPods trunk read-only plan (blog.cocoapods.org)

Search engine referral report for 2025 Q2 (radar.cloudflare.com)

Google AI Overview made up an elaborate story about me (bsky.app)

Can You Develop Film in a Jägerbomb? (petapixel.com)

Ask HN: Who wants to be hired? (September 2025)

An adventure in writing compatible systems (turso.tech)

A review of Nim 2: The good and bad with example code (miguel-martin.com)

A Unique, High-Tech (Family) Computer (nicole.express)

The time picker on the iPhone's alarm app isn't circular, it's just a long list (old.reddit.com)

Python: The Documentary – An origin story [video] (youtube.com)

Isolated(any) (nshipster.com)

Tetris is NP-hard even with O(1) rows or columns (2020) [pdf] (martindemaine.org)

Preserving Order in Concurrent Go Apps: Three Approaches Compared (destel.dev)

India's billion-dollar e-waste empire (restofworld.org)

Zfsbackrest: Pgbackrest style encrypted backups for ZFS filesystems (github.com)

UK's largest battery storage facility at Tilbury substation (nationalgrid.com)

We should have the ability to run any code we want on hardware we own (hugotunius.se)

What Is Complexity in Chess? (lichess.org)

Lessons from building an AI data analyst (pedronasc.com)

What brain surgery taught me about the fragile gift of consciousness (bigthink.com)

Eternal Struggle (yoavg.github.io)

Show HN: Simple modenized .NET NuGet server reached RC (github.com)

Welcome to the Technocracy: Dreams of forgotten movement from the 1930s live on (novum.substack.com)

Nintendo Switch 2 Dock USB-C Compatibility (lttlabs.com)

Lewis and Clark marked their trail with laxatives (offbeatoregon.com)

Bash Prompts Collection (gilesorr.com)

Trade in War (news.mit.edu)

C++: Strongly Happens Before? (nekrozqliphort.github.io)

Anti-establishment versus authoritarian populists and support for the strongman (frontiersin.org)

Use One Big Server (2022) (specbranch.com)

Compiling Dinner (gist.github.com)

De-Googling TOTP Authenticator Codes (imrannazar.com)

A Linux version of the Procmon Sysinternals tool (github.com)

Git for Music – Using Version Control for Music Production (2023) (grechin.org)

Pong Clock (bigjobby.com)

Chronicle – Idiomatic, type safe event sourcing framework for Go (github.com)

A Crack in the Cosmos (drb.ie)

What to do with C++ modules? (nibblestew.blogspot.com)

Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts

Comments (0)