Show HN: Nallely – A Python signals/MIDI processing system inspired by Smalltalk (dr-schlange.github.io)

FYI it also supports pre-training, reward model training and RL, not just fine tuning (sft). My team built a managed solution for training that runs on top of llama factory and it's quite excellent and well supported. You will need pretty serious equipment to get good results out of it, think 8xh200. For people at home i would look at doing an sft of gemma3 270m or maybe a 1.6b qwen3, but keep in mind you have to have the dataset in memory as well as the model and kv-cache. cheers

metadat · 34m ago

This reminds me conceptually of the Nvidia NIM factory where they attempt to optimize models in bulk / en-masse.

https://www.nvidia.com/en-us/ai/nim-for-manufacturing/

Word on the street is the project has yielded largely unimpressive results compared to its potential, but NV is still investing in an attempt to further raise the GPU saturation waterline.

p.s. This project logo stood out to me at presenting the Llama releasing some "steam" with gusto. I wonder if that was intentional? Sorry for the immature take but stopping the scatological jokes is tough.

tensorlibb · 17m ago

This is incredible! What gpu configs, budget to ultra high-end, would you recommend for local fine tuning?

Always curious to see what other ai enthusiasts are running!

Twirrim · 1h ago

https://llamafactory.readthedocs.io/en/latest/

I found this link more useful.

"LLaMA Factory is an easy-to-use and efficient platform for training and fine-tuning large language models. With LLaMA Factory, you can fine-tune hundreds of pre-trained models locally without writing any code."

hall0ween · 49m ago

are there any use cases, aside from code generation and formatting, where fine-tuning consistently useful?

clipclopflop · 18m ago

Creating small, specialized models for specific tasks. Being able to leverage the up front training/data as a generalized base allows you to quickly create a small local model that can generate outputs for that task that can come close to or match the same you would see in a large/hosted model.

Apple: SSH and FileVault (keith.github.io)

The Sagrada Família Takes Its Final Shape (newyorker.com)

Nvidia buys $5B in Intel (tomshardware.com)

David Lynch LA House (wallpaper.com)

Want to piss off your IT department? Are the links not malicious looking enough? (phishyurl.com)

Learn Your Way: Reimagining Textbooks with Generative AI (research.google)

This map is not upside down (maps.com)

AI tools are making the world look weird (strat7.com)

Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs (github.com)

Rupert's snub cube and other Math Holes (tom7.org)

Meta’s live demo fails; “AI” recording plays before the actor takes the steps (reddit.com)

Show HN: Asxiv.org – Ask ArXiv papers questions through chat (asxiv.org)

Show HN: I created a small 2D game about an ant (aanthonymax.github.io)

Visual lexicon of consumer aesthetics from the 1970s until now (cari.institute)

Launch HN: Cactus (YC S25) – AI inference on smartphones (github.com)

Configuration files are user interfaces (ochagavia.nl)

Slack has raised our charges by $195k per year (skyfall.dev)

Tldraw SDK 4.0 (tldraw.dev)

TernFS – An exabyte scale, multi-region distributed filesystem (xtxmarkets.com)

KDE is now my favorite desktop (kokada.dev)

Flipper Zero Geiger Counter (kasiin.top)

Tracking Trust with Rust in the Kernel (lwn.net)

Luau – Fast, small, safe, gradually typed scripting language derived from Lua (luau.org)

Classic recessive-or-dominant gene dynamics may not be so simple (news.stanford.edu)

Show HN: Nallely – A Python signals/MIDI processing system inspired by Smalltalk (dr-schlange.github.io)

OpenTelemetry Collector: What It Is, When You Need It, and When You Don't (oneuptime.com)

When Knowing Someone at Meta Is the Only Way to Break Out of "Content Jail" (eff.org)

They Know More Than I Do (cybadger.com)

TIC-80 – Tiny Computer (tic80.com)

The quality of AI-assisted software depends on unit of work management (blog.nilenso.com)

Nvmath-Python: Nvidia Math Libraries for the Python Ecosystem (github.com)

PostgreSQL Maintenance Without Superuser (boringsql.com)

Aaron Levie: Startups win in the AI era [video] (youtube.com)

Midcentury North American Restaurant Placemats (casualarchivist.substack.com)

American Prairie unlocks another 70k acres in Montana (earthhope.substack.com)

Pnpm has a new setting to stave off supply chain attacks (pnpm.io)

ICE unit signs new $3M contract for phone-hacking tech (techcrunch.com)

OneDev – Self-hosted Git server with CI/CD, Kanban, and packages (onedev.io)

CircuitHub (YC W12) Is Hiring Operations Research Engineers (UK/Remote) (ycombinator.com)

This website has no class (aaadaaam.com)

I Built an Event-Sourcing Database Engine: Meet Genesis DB (genesisdb.io)

Dark patterns killed my wife's Windows 11 installation (osnews.com)

Grief gets an expiration date, just like us (bessstillman.substack.com)

Show HN: Dyad, local, open-source Lovable alternative (Electron desktop app) (dyad.sh)

U.S. already has the critical minerals it needs, according to new analysis (minesnewsroom.com)

Automatic differentiation can be incorrect (stochasticlifestyle.com)

Fast Fourier Transforms Part 1: Cooley-Tukey (connorboyle.io)

California electric vehicle drivers will lose carpool lane privileges (latimes.com)

The Day the Linter Broke My Code (blog.fillmore-labs.com)

WASM 3.0 Completed (webassembly.org)

Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs

Comments (6)