Taking Notes Effectively – which words should you write down? (youtube.com)

1 points by Anon84 1m ago 0 comments

We Help Early-Stage Founders Avoid IRS Penalties by Automating Tax Reminders (taxhero.vc)

1 points by salleisha 3m ago 1 comments

Finite Vector Fields Visualized (yons.ch)

1 points by andersource 3m ago 0 comments

Ask HN: How to Work with Seagull Principals?

1 points by mystickphoenix 5m ago 0 comments

What Is LLM Tokenization and Why Is It Important? (medium.com)

1 points by tokfan 5m ago 0 comments

Pypistats.org Is Down (github.com)

1 points by runningmike 5m ago 0 comments

I've built an easy to use and universal Price Tracker for Chrome (chromewebstore.google.com)

1 points by SBDomains 6m ago 1 comments

A Supreme Court Coup D'Etat in Brazil (wsj.com)

2 points by matheusmoreira 7m ago 1 comments

Built first postcrawl legal data license, solves Perplexity vs. Cloudflare (aiprivacylicense.com)

1 points by nabanitade 8m ago 0 comments

GitHub: Build secure and scalable remote MCP servers (github.blog)

1 points by saqadri 10m ago 0 comments

FedRAMP government cloud software approvals double under new program (theregister.com)

1 points by rntn 11m ago 0 comments

Tell HN: GPT means I farted in French

2 points by somewhatrandom9 12m ago 0 comments

I've seen 12 people hospitalized after losing touch with reality because of AI (twitter.com)

3 points by fortran77 12m ago 0 comments

Starbucks Korea bans customers from bringing desktops, printers (upi.com)

1 points by sugarpimpdorsey 13m ago 0 comments

Cupid: For Joyful Coding (2022) (dannorth.net)

2 points by todsacerdoti 16m ago 0 comments

Run Whisper audio transcriptions with one FFmpeg command (medium.com)

3 points by wmf 19m ago 0 comments

An Affordable DIY Plug-and-Play Nucleic Acid Fluorometer for EDNA Quantification (biorxiv.org)

1 points by PaulHoule 20m ago 0 comments

WildChat-4.8M: 4.8M Real User–ChatGPT Conversations (Open Dataset) (huggingface.co)

2 points by yuntian 20m ago 1 comments

Nova Scotia bans hiking and use of vehicles in woods due to wildfire fears (cbc.ca)

3 points by zahlman 21m ago 1 comments

Diabetic Man with Gene-Edited Cells Makes Insulin–No Transplant Drugs Required (gizmodo.com)

1 points by dduugg 23m ago 0 comments

Snapshots of Kids Bike Jumping in the 1970s (flashbak.com)

1 points by bookofjoe 25m ago 0 comments

Debian 13 arrives with major updates for Linux users – what's new in 'Trixie' (zdnet.com)

1 points by CrankyBear 25m ago 0 comments

Summer 2025 a cavalcade of climate extremes (phys.org)

1 points by panarchy 25m ago 0 comments

U.S. Marches Toward State Capitalism with American Characteristics (wsj.com)

4 points by JumpCrisscross 25m ago 3 comments

Toxic Shale Drilling Wastewater Threatens Top Oil Fields, Texas Agency Warns (bloomberg.com)

1 points by JumpCrisscross 26m ago 0 comments

I Gave Up Trying to Run Gemma 3 27B on AWS P3dn.24xlarge (medium.com)

1 points by live_alone 27m ago 0 comments

GM Plans Renewed Driverless-Car Push After Cruise Debacle (bloomberg.com)

1 points by JumpCrisscross 27m ago 0 comments

Unix: Making Computers Easier to Use – AT&T Archives, Bell Laboratories (1982) [video] (youtube.com)

3 points by doener 29m ago 0 comments

1968 "Mother of All Demos" by SRI's Doug Engelbart and Team [video] (youtube.com)

1 points by doener 30m ago 0 comments

Blue whales suddenly going silent. Why they think it's happening (aol.com)

1 points by Bluestein 32m ago 0 comments

Human Data Is (Probably) More Expensive Than Compute for Training Frontier LLMs (ddkang.substack.com)

1 points by plokker 32m ago 0 comments

SvelteKit Experimental Remote Functions (svelte.dev)

2 points by angelmm 34m ago 0 comments

Mosfet Dreams (blog.ajith.fyi)

1 points by notmysql_ 34m ago 0 comments

The AI Bandwidth Wall and Co-Packaged Optics [video] (youtube.com)

1 points by nabla9 36m ago 0 comments

More Than a Safety Net – Value Created by Unemployment Benefits (nominalnews.com)

1 points by MPLan 38m ago 1 comments

DEA agent used IL cop's Flock ALPR password for immigration enforcement searches (unraveledpress.com)

1 points by chaps 40m ago 0 comments

I Made a Floppy Disk from Scratch [video] (youtube.com)

4 points by armandososa 41m ago 0 comments

Ask HN: Could we pool our machines to run more powerful AI models locally?

1 points by floweronthehill 42m ago 1 comments

DEF Con 33 Top Talk Titles (yashthapliyal.com)

1 points by yash1hi 42m ago 1 comments

US scrambles to recoup $1M+ nicked by NORKs (theregister.com)

2 points by rntn 43m ago 0 comments

Linux Foundation forces 'woke' inclusive language rules on developers (nerds.xyz)

2 points by BeauNer 46m ago 1 comments

Ask HN: Transition from back end to programming language research

1 points by akkad33 49m ago 0 comments

Missing.css 1.2.0 (missing.style)

2 points by todsacerdoti 49m ago 0 comments

Freedesktop.org – Desktop Interoperability Standards (freedesktop.org)

1 points by EPendragon 51m ago 0 comments

The LattePanda Mu (taoofmac.com)

1 points by speckx 51m ago 0 comments

We're becoming more forgiving when algorithms mess up (theconversation.com)

1 points by geox 52m ago 0 comments

Bcachefs to be removed from mainline Linux kernel (lore.kernel.org)

8 points by rastignack 52m ago 0 comments

Show HN: I built an open-source workout tracker app in 4 days with Claude Code (github.com)

1 points by randomlylelo 54m ago 0 comments

Trout Tickling (en.wikipedia.org)

1 points by koolba 54m ago 0 comments

AWS Lambda GitHub Actions function deployment (aws.amazon.com)

2 points by EPendragon 55m ago 0 comments

Ollama and gguf

18 indigodaddy 6 8/11/2025, 5:54:08 PM github.com ↗

Comments (6)

llmthrowaway · 33s ago

Confusing title - thought this was about Ollama finally supporting sharded GGUF (ie. the Huggingface default for large gguf over 48gb).

https://github.com/ollama/ollama/issues/5245

Sadly it is not and the issue still remains open after over a year meaning ollama cannot run the latest SOTA open source models unless they covert them to their proprietary format which they do not consistently do.

No surprise I guess given they've taken VC money, refuse to properly attribute the use things like llama.cpp and ggml, have their own model format for.. reasons? and have over 1800 open issues...

Llama-server, ramallama or whatever model switcher ggerganov is working on (he showed previews recently) feel like the way forward.

indigodaddy · 3h ago

ggerganov explains the issue: https://github.com/ollama/ollama/issues/11714#issuecomment-3...

polotics · 2m ago

ggerganov is my hero, and... it's a good thing this got posted so I saw in the comments that --flash-attn --cache-reuse 256 could help with my setup (M3 36GB + RPC to M116GB) figuring out what params to set and at what value is a lot of trial and error, Gemini does help a bit clarify what params like top-k are going to do in practice. Still the whole load-balancing with RPC is something I think I'm going to have to read the source of llama.cpp to really understand (oops I almost wrote grok, damn you Elon) Anyways ollama is still not doing distributed load, and yeah I guess using it is a stepping stone...

magicalhippo · 2h ago

I noticed it the other way, llama.cpp failed to download the Ollama-downloaded gpt-oss 20b model. Thought it was odd given all the others I tried worked fine.

Figured it had to be Ollama doing Ollama things, seems that was indeed the case.

dcreater · 2h ago

I think the title buries the lede? Its specific to GPT-OSS and exposes the shady stuff Ollama is doing to acquiesce/curry favor/partner with/get paid by corporate interests

freedomben · 38m ago

I think "shady" is a little too harsh - sounds like they forked an important upstream project, made incompatible changes that they didn't push upstream or even communicate with upstream about, and now have to deal with the consequences of that. If that's "shady" (despite being all out in the open) then nearly every company I've worked for has been "shady."