Elizabeth Holmes's Partner Has a New Blood-Testing Startup (nytimes.com)

1 points by wslh 27s ago 0 comments

Investment Promised a 25% Yield, Then Collapsed 98% (2020) (nasdaq.com)

1 points by paulpauper 3m ago 0 comments

AI Is the Answer to Everything (banagale.com)

3 points by bredren 10m ago 1 comments

How China Became the World's Biggest Shipbuilder (construction-physics.com)

1 points by throw0101b 14m ago 0 comments

Why It Feels Like Every Company Suddenly Wants to Sell You Protein [video] (youtube.com)

1 points by mgh2 22m ago 1 comments

OpenAI Ignored IMO Request, Announced Math Results Before Closing Ceremony (twitter.com)

6 points by py4 22m ago 0 comments

Thawing vacuum-packed fish correctly (canr.msu.edu)

2 points by js2 28m ago 1 comments

Rethinking "Progress": A Hard Look at Sustainability

1 points by upwardbound2 28m ago 0 comments

WordPecker: Open-Source Personalized Duolingo

4 points by arbayi 28m ago 0 comments

The Physical Turing Test: Jim Fan on Nvidia's Roadmap for Embodied AI (youtube.com)

2 points by mgh2 30m ago 0 comments

Crayola CEO's how-to-succeed guide: Lose the tie pretend you don't know anything (aol.com)

2 points by Bluestein 31m ago 0 comments

Assessing interstellar comet 3I/ATLAS with the 10.4M Gran Telescopio Canarias (arxiv.org)

2 points by bikenaga 33m ago 0 comments

AI Coding Tools Underperform in Field Study with Experienced Developers (infoq.com)

4 points by mikece 35m ago 0 comments

IPv6 Based Canvas (canvas.openbased.org)

2 points by tylermarques 37m ago 0 comments

Think Toggles Are Dumb (paritybits.me)

3 points by LorenDB 43m ago 0 comments

Trump threatens stadium deal unless NFL team readopts Redskins name (reuters.com)

11 points by geox 49m ago 3 comments

U.S.-Based Wells Fargo Banker Blocked from Leaving China (wsj.com)

3 points by impish9208 49m ago 1 comments

Delta Air Lines is using AI to set the maximum price you're willing to pay (theverge.com)

7 points by pseudolus 52m ago 1 comments

How Distillation Makes AI Models Smaller and Cheaper (quantamagazine.org)

2 points by pseudolus 56m ago 0 comments

'Flutter': The song that saved raves from a government ban (faroutmagazine.co.uk)

2 points by joelanman 59m ago 0 comments

Cuban Experiences on Computing and Education (link.springer.com)

1 points by marcodiego 1h ago 0 comments

The Last SS Guard (zeit.de)

1 points by slow_typist 1h ago 0 comments

Nvidia Bringing CUDA to RISC-V (phoronix.com)

1 points by michaelkrem 1h ago 0 comments

Mathematical Foundations for Finance (metaphor.ethz.ch)

1 points by ibobev 1h ago 0 comments

Cloudflare Learning Center (cloudflare.com)

2 points by vaughands 1h ago 0 comments

Global hack on Microsoft product hits U.S., state agencies, researchers say (washingtonpost.com)

4 points by spenvo 1h ago 1 comments

Longevity Expert Breaks Down the Science and Hype of Biological Aging Tests (scientificamerican.com)

2 points by Bluestein 1h ago 0 comments

I Found Myself in the Game Industry (nothings.org)

1 points by Bogdanp 1h ago 0 comments

Ask HN: How do I prevent AI from reading/training off my content?

3 points by blindriver 1h ago 2 comments

Gwynne Shotwell (en.wikipedia.org)

1 points by danielschreber 1h ago 0 comments

Allergies seem nearly impossible to avoid – unless you're Amish (bostonglobe.com)

13 points by aarghh 1h ago 2 comments

ETH Zürich introduces new policies for spinoffs and startups (ethz.ch)

1 points by simonpure 1h ago 0 comments

More About Jumps Than You Wanted to Know (gpfault.net)

1 points by nice_byte 1h ago 0 comments

Beast X for Bayesian phylogenetic, phylogeographic and phylodynamic inference (nature.com)

1 points by bookofjoe 1h ago 0 comments

UK government seeks way out of clash with US over Apple encryption (ft.com)

3 points by leocassarani 1h ago 2 comments

Kenneth Colley, 87, 'Star Wars' Actor with a Commanding Presence, Dies (nytimes.com)

2 points by donohoe 1h ago 1 comments

After effectively banning porn, Idaho state government shows visitors porn (thecrow.uk)

4 points by dddavid 1h ago 1 comments

Show HN: Kara Auto Translate – Free English to ASL Translation, Open Beta (kara.tech)

1 points by ffarhour 1h ago 0 comments

Every hour 100 people die of loneliness-related causes, UN health agency reports (news.un.org)

2 points by ironyman 1h ago 0 comments

Spending Time With The Casio F-91W (2024) (fratellowatches.com)

2 points by ecliptik 1h ago 0 comments

A Push for More Organ Transplants Is Putting Donors at Risk (nytimes.com)

6 points by jonas21 1h ago 3 comments

Australia Wants to See Your Papers Before You Press Play (reclaimthenet.org)

6 points by like_any_other 1h ago 0 comments

BlackRock's Bitcoin ETF Generating More Revenue Than Its Flagship S&P 500 Fund (coindesk.com)

2 points by PaulHoule 1h ago 0 comments

Peering into the 'Double Black Box' of National Security and AI (lawfaremedia.org)

1 points by EA-3167 1h ago 0 comments

What communities or growth hacks helped you get your first 1k users (x.com)

1 points by SaaSified 1h ago 0 comments

Adventures in VDI (taoofmac.com)

2 points by rcarmo 1h ago 0 comments

Could NASA Function Without Elon Musk? (bloomberg.com)

3 points by Bluestein 1h ago 2 comments

Navigating AI Dementia: Strategies for Safe Rollback

2 points by upwardbound2 1h ago 1 comments

I Built a Free AI Toolkit to Automate Project Reports (Notion and ChatGPT) (aipmtoolkit.feather.blog)

1 points by aipmtools 1h ago 1 comments

David Sacks and the blurred lines of government service (techcrunch.com)

10 points by rntn 1h ago 2 comments

Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs

1 dips2umar 1 7/20/2025, 6:14:15 PM

GitHub: https://github.com/diptanshu1991/LoFT

I built *LoFT*, a lightweight CLI that turns any 8 GB laptop into a tiny LLM training and inference rig — no GPU, no cloud.

5 Commands: 1. `loft finetune` → Train LoRA adapters on CPU 2. `loft merge` → Merge adapters into model 3. `loft export` → Convert to GGUF (FP16) 4. `loft quantize` → Apply Q4_0 (4-bit) quantization 5. `loft chat` → llama.cpp CPU chat @ ~7 tok/s

Benchmarks on 8 GB MacBook Air: | Step | Time | Peak RAM | |-------------|--------|----------| | Finetune | 23 min (sample run) | 308 MB | | Merge | 4.7 min | 322 MB | | Quantize | 21 sec | 322 MB | | Inference | 6.9 tok/s | 322 MB |

Also ran a full 300-row Dolly finetune (2 epochs) in *~1.5 hours*, achieving *sub-1 loss* on CPU-only setup. No crashes, swap kills, or GPU needed.

Why this matters: - Makes local LLM customization accessible to devs without GPU access - Enables domain-specific agents (summarizer, support bot, Q&A) on commodity laptops - Everything runs via CPU (no CUDA, no cloud)

Would love feedback on: - UX improvements or edge cases - Adapter recipes you’d want (legal, summarization, customer support, etc.) - Cool things you’d build with low-RAM LLMs

MIT-licensed, 100% local. Feedback is very welcome.

– Diptanshu \

Comments (1)

dips2umar · 4h ago

Author here — happy to answer any questions!

One thing that surprised us: on an 8 GB M2 Air, peak RAM never exceeded 330 MB during a full 300-sample finetune (2 epochs) — thanks to gradient checkpointing, which reduces memory usage by recomputing activations instead of storing them.

If anyone tries LoFT on Windows or Linux, I’d love to hear your first-token latency with `loft chat`. On macOS we see ~145 ms/token with TinyLlama + GGUF.