Unsolved Problems in MLOps (spawn-queue.acm.org)

1 points by jamesblonde 53s ago 0 comments

Ultra Ethernet's Design Principles and Architectural Innovations (arxiv.org)

1 points by tanelpoder 1m ago 0 comments

After Afghan Quake, Many Male Rescuers Helped Men but Not Women (nytimes.com)

1 points by 7402 2m ago 0 comments

MyAI101: Foundational AI literacy for students, teachers and curious adults (myai101.com)

1 points by yzh 2m ago 1 comments

Azelastine Nasal Spray for Prevention of SARS-CoV-2 Infections (jamanetwork.com)

1 points by notmyjob 4m ago 0 comments

Submit your ideas for Interop 2026 (webkit.org)

1 points by ksec 5m ago 0 comments

A Nighttime Raid (nytimes.com)

1 points by jbegley 7m ago 0 comments

Pig lung transplanted into brain-dead person for 9 days (livescience.com)

1 points by gmays 7m ago 0 comments

AI robots can carve stone statues. buildings are next (fastcompany.com)

1 points by warrenm 8m ago 0 comments

Digital Terraforming (digitalterraforming.com)

1 points by thenthenthen 8m ago 0 comments

Tech 'I'm glad it's over.' Google CEO thanks Trump for antitrust 'resolution' (cnbc.com)

1 points by 01-_- 8m ago 0 comments

Astronomers Discover Molecular Cloud Hidden in Milky Way (greenbankobservatory.org)

2 points by 01-_- 9m ago 0 comments

The "Brain Juice" Method for Speechwriting in the Age of AI (chiefwordofficer.substack.com)

2 points by itoshinoeri 9m ago 0 comments

Bullets with butterfly wings: nature conservation and the military (shrubstack.substack.com)

1 points by thinkingemote 10m ago 0 comments

Inside My Study of the Oldest Companies (bigthink.com)

1 points by warrenm 10m ago 0 comments

Do you know any design first engineering blogs?

1 points by ShaggyHotDog 10m ago 0 comments

A Defiant Kennedy Defends Vaccine Changes and CDC Shake-Up (nytimes.com)

2 points by chirau 10m ago 0 comments

They're Making Dutch Golden Age Art in Minecraft (pushtotalk.gg)

1 points by speckx 11m ago 0 comments

What I learned managing an AI developer while seeking enlightenment (pocha.substack.com)

2 points by suninsight 13m ago 0 comments

YouTube Continues Targeting Poker Content Creators (pokernews.com)

1 points by indigodaddy 13m ago 0 comments

Alma Reveals an Eccentricity Gradient in the Fomalhaut Debris Disk (iopscience.iop.org)

1 points by PaulHoule 13m ago 0 comments

Parameters and binding forms should be mutually recursive (samestep.com)

1 points by sestep 13m ago 0 comments

Specificity Calculator: A visual way to understand CSS specificity (specificity.keegan.st)

1 points by eustoria 16m ago 0 comments

Free Nano Banana Prompt for AI Image Editing (nanoprompt.net)

1 points by MintNow 17m ago 0 comments

Choosing TypeScript Typing Patterns to Reduce Tech Debt (jsdev.space)

1 points by eustoria 17m ago 0 comments

Video Game Blurs (blog.frost.kiwi)

1 points by FrostKiwi 18m ago 0 comments

Spec-driven development with AI: Get started with a new open-source toolkit (github.blog)

1 points by WolfOliver 19m ago 0 comments

Why Gen X is the real loser generation (economist.com)

1 points by throw0101c 20m ago 1 comments

Baseten raises $150M Series D at $2.15B (fortune.com)

1 points by philipkiely 21m ago 0 comments

Real-world epilepsy monitoring with subcutaneous electroencephalography (onlinelibrary.wiley.com)

1 points by PaulHoule 22m ago 0 comments

One of your best defenses against phishing emails is still hovering over links (ghacks.net)

1 points by speckx 23m ago 0 comments

Comparison of the Effects of Stirring and Standing on Chemical Reactions (thieme-connect.de)

1 points by bookofjoe 23m ago 0 comments

Bybit Wallet Hacked (me14solutionsltd.org)

2 points by ytftyu 24m ago 1 comments

I'm curating my own tech newsletter to keep myself stay ahead (chilldog-news-feed.lovable.app)

1 points by bingwu1995 24m ago 0 comments

A computer upgrade has shut down BART (bart.gov)

4 points by ksajadi 26m ago 0 comments

Do you like playing wordle games?Here are the best Wordle Solver tools to use (wordlesolver.best)

1 points by bitvvip 26m ago 0 comments

The Bible-inspired game soundtrack that's giving people goosebumps [video] (youtube.com)

1 points by andygeers 27m ago 0 comments

How big are our embeddings now and why? (vickiboykis.com)

2 points by surprisetalk 31m ago 0 comments

Thioester-mediated RNA aminoacylation and peptidyl-RNA synthesis in water (nature.com)

1 points by surprisetalk 31m ago 0 comments

Phone for Kids Reinvented: Tin Can Brings Back the Landline (seattleschild.com)

2 points by surprisetalk 31m ago 0 comments

Visual Development Is Fast Until Big-O Complexity Slows It Down. Here's Our Fix (jinen83.github.io)

2 points by jinen83 33m ago 0 comments

Cookies placed without consent: SHEIN fined 150M euros by the CNIL (cnil.fr)

4 points by robin_reala 34m ago 0 comments

WTF reshaped stand-up, podcasting, and a whole lot more (slate.com)

1 points by colinprince 34m ago 0 comments

Artificial connections: Romantic relationship engagement with AI in the US (journals.sagepub.com)

2 points by cainxinth 35m ago 0 comments

USGS Unveils New National Geologic Map (usgs.gov)

4 points by speckx 35m ago 2 comments

Why Lean 4 replaced OCaml as my primary language (kirancodes.me)

3 points by fanf2 37m ago 0 comments

OpenAI links up with Broadcom to produce its own AI chips (arstechnica.com)

4 points by ukuina 37m ago 2 comments

Just One More Prompt (commandpattern.org)

1 points by robmurrer 40m ago 1 comments

Why Browser Company at $610M is cheap (bigtechpr.substack.com)

13 points by meshugaas 41m ago 17 comments

sqlalchemy check constraints and operator precedence (blog.kobaltlabs.com)

1 points by ashia 42m ago 0 comments

Show HN: Shimmy – 5MB privacy-first, local alternative to Ollama (680MB)

12 MKuykendall 9 9/4/2025, 6:10:12 PM github.com ↗

Comments (9)

MKuykendall · 21h ago

Hey HN! I built this because I was tired of waiting 10 seconds for Ollama's 680MB binary to start just to run a 4GB model locally.

Quick demo - working VSCode + local AI in 30 seconds: curl -L https://github.com/Michael-A-Kuykendall/shimmy/releases/late... ./shimmy serve # Point VSCode/Cursor to localhost:11435

The technical achievement: Got it down to 5.1MB by stripping everything except pure inference. Written in Rust, uses llama.cpp's engine.

One feature I'm excited about: You can use LoRA adapters directly without converting them. Just point to your .gguf base model and .gguf LoRA - it handles the merge at runtime. Makes iterating on fine-tuned models much faster since there's no conversion step.

Your data never leaves your machine. No telemetry. No accounts. Just a tiny binary that makes GGUF models work with your AI coding tools.

Would love feedback on the auto-discovery feature - it finds your models automatically so you don't need any configuration.

What's your local LLM setup? Are you using LoRA adapters for anything specific?

carlos_rpn · 20h ago

You may have noticed already, but the link to the binary is throwing a 404.

MKuykendall · 19h ago

This should be fixed now!

stupidgeek314 · 12h ago

Windows Defender tripped this for me, calling it out as Bearfoos trojan. Most likely a false positive, but jfyi.

MKuykendall · 2h ago

Try cargo install or intentionally exclude, unsigned Rust binaries will do this.

cat-turner · 4h ago

looks cool, ty! really great project will try this out.

homarp · 21h ago

Nice, a rust tool wrapping llama.cpp

how does it differ from llama-server?

and from llama-swap?

MKuykendall · 20h ago

Shimmy is designed to be "invisible infrastructure" - the simplest possible way to get local inference working with your existing AI tools. llama-server gives you more control, llama-swap gives you multi-model management.

  Key differences:
  - Architecture: llama-swap = proxy + multiple servers, Shimmy = single server
  - Resource usage: llama-swap runs multiple processes, Shimmy = one 50MB process
  - Use case: llama-swap for managing many models, Shimmy for simplicity

MKuykendall · 20h ago

Shimmy is for when you want the absolute minimum footprint - CI/CD pipelines, quick local testing, or systems where you can't install 680MB of dependencies.