Show HN: A lightweight code visualizer tool for TypeScript and C# projects (codemaps.silverveskilt.com)

> Not to mention, industry consensus is that the "smallest good" models start out at 70-120 billion parameters. At a 64k token window, that easily gets into the 80+ gigabyte of video memory range, which is completely unsustainable for individuals to host themselves.

Worth a tiny addendum, GPT-OSS-120b (at mxfp4 with 131,072 context size) lands at about ~65GB of VRAM, which is still large but at least less than 80GB. With 2x 32GB GPUs (like R9700, ~1300USD each) and slightly smaller context (or KV cache quantization), I feel like you could fit it, and becomes a bit more obtainable for individuals. 120b with reasoning_effort set to high is quite good as far as I've tested it, and blazing fast too.

fleebee · 6m ago

What's worth noting is that the companies providing LLMs are also strongly pushing people into using their LLMs in unhealthy ways. Facebook has started shoving their conversational chatbots into people's faces.[1] That none of the big companies are condemning or blocking this kind of LLM usage -- but are in fact advocating for it -- is telling of their priorities. Evil is not a word I use lightly but I think we've reached that point.

[1]: https://www.reuters.com/investigates/special-report/meta-ai-...

diggan · 1m ago

> Evil is not a word I use lightly but I think we've reached that point.

It was written in sand as soon as Meta started writing publicly about AI Personalities/Profiles on Instagram, or however it started. If I recall correctly, they announced it more than two years ago?

Aurornis · 25m ago

> I feel like this should go without saying, but really, do not use an AI model as a replacement for therapy.

I know several people who rave about ChatGPT as a pseudo-therapist, but from the outside the results aren’t encouraging. They like the availability and openness they experience by taking to a non-human, but they also like the fact that they can get it to say what they want to hear. It’s less of a therapist and more of a personal validation machine.

You want to feel like the victim in every situation, have a virtual therapist tell you that everything is someone else’s fault, and validate choices you made? Spend a few hours with ChatGPT and you learn how to get it to respond the way you want. If you really don’t like the direction a conversation is going you delete it and start over, reshaping the inputs to steer it the way you want.

Any halfway decent therapist will spot these behaviors and at least not encourage them. LLM therapists seem to spot these behaviors and give the user what they want to hear.

Note that I’m not saying it’s all bad. They seem to help some people work through certain issues, rubber duck debugging style. The trap is seeing this success a few times and assuming it’s all good advice, without realizing it’s a mirror for your inputs.

Hackbraten · 41m ago

> At a 64k token window, that easily gets into the 80+ gigabyte of video memory range, which is completely unsustainable for individuals to host themselves.

A desktop computer in that performance tier (e.g. an AMD AI Max+ 395 with 128 GB of shared memory) is expensive but not prohibitively so. Depending on where you live, one year of therapy may cost more than that.

jchw · 31m ago

It seems like the Framework Desktop has become one of the best choices for local AI on the whole market. At a bit over $2,000 you can get a machine that can have, if I understand correctly, around 120 GiB of accessible VRAM, and the seemingly brutal Radeon 8060S, whose iGPU performance appears to only be challenged by a fully loaded Apple M4 Max, or of course a sufficiently big dGPU. The previous best options seem to be Apple, but for a similar amount of VRAM I can't find a similarly good deal. (The last time I could find an Apple Silicon device that sold for ~$2,000 with that much RAM on eBay, it was an M1 Ultra.)

I am not really dying to run local AI workloads, but the prospect of being able to play with larger models is tempting. It's not $2,000 tempting, but tempting.

layer8 · 9m ago

There are a dozen or more (mostly Chinese) manufacturers coming out with mini PCs based on that Ryzen AI Max+ 395 platform, like for example the Bosgame M5 AI Mini for just $1699 with 128GB. Just pointing out that this configuration is not a Framework exclusive.

Aurornis · 23m ago

FYI there are a number of Strix Halo boards and computers out in the market already. The Framework version looks to be high quality and have good support, but it’s not the only option in this space.

Also take a good hard look at the token output speeds before investing. If you’re expecting quality, context windows, and output speeds similar to the hosted providers you’re probably going to be disappointed. There are a lot of tradeoffs with a local machine.

walterbell · 14m ago

HP Z2 Mini G1a with 128GB and Strix Halo is ~$5K, https://www.notebookcheck.net/Z2-Mini-G1a-HP-reveals-compara...

alistairSH · 57m ago

I can’t help but think we’re accelerating our way to a truly dystopian future. Like Bladerunner, but worse, maybe.

troupo · 42m ago

We're already in early stages of Bladerunner.

bit1993 · 22m ago

We should not forget that LLMs simply replicate the data humans have put on the WWW. LLM tech could only have come from Google search, who indexed and collected the entire data on the WWW and the next step was to develop algorithms to understand the data and give better search results. This also shows the weakness of LLMs, they depend on human data and as LLM companies continue to try to replace humans, the humans will simply stop feeding LLMs their data, more and more data will go behind paywalls, more code will become closed source, simple supply and demand economics. LLMs cannot make progress without new data because the world-culture moves rapidly in real-time.

walterbell · 18m ago

> LLMs cannot make progress without new data because the world-culture moves rapidly in real-time.

This helps services where users generate content, reducing licensing cost and latency of accessing external content.

Multiple Concepts of Equality in the New Foundations of Mathematics [pdf] (math.ias.edu)

Show HN: Detecting hallucinations in LLM function calling with entropy (archgw.com)

Open Retrieval-Based Inference Toolkit (github.com)

Creating grug: the perfect modding language [video] (youtube.com)

Show HN: LimeLink Update – Added API Support, Custom Subdomains, Link Parameters (limelink.org)

(open source) Share Sensitive Data Securely (dele.to)

Tally (tally-game.com)

Weizenbaum examines computers and society (1985) (web.archive.org)

AI Doesn't Lighten the Burden of Mastery; AI Makes It Easy to Stop Valuing It (playtechnique.io)

Show HN: A lightweight code visualizer tool for TypeScript and C# projects (codemaps.silverveskilt.com)

Wifiscan-rs – A cross-platform Rust crate for scanning Wi-Fi networks (codeberg.org)

You can just build things (maxrozen.com)

Why the UK public sector creaks along on COBOL (theregister.com)

The adults still obsessed with Chuck E. Cheese (thehustle.co)

US hits highest layoffs since Covid (newsweek.com)

US wholesale prices jump 3.3% as Trump tariffs hit economy (ft.com)

Sixteen Bottles of Wine Riddle (chriskw.xyz)

Janito v2.28.0 – CLI LLM tool with terminal bells and provider fixes (pypi.org)

The Enterprise Experience (churchofturing.github.io)

Using Sound to Remember Quantum Information (caltech.edu)

But what is quantum computing? (Grover's Algorithm) [video] (youtube.com)

Just People in a Room (bonnycode.com)

Resurrecting the Most Useless Piece of Vintage Computing Technology – The Modem (nerdlypleasures.blogspot.com)

Dining across the divide: Can breaking bread help bridge political differences? (theguardian.com)

Weighted vests for weight-loss maintenance? – Peter Attia (peterattiamd.com)

Ask HN: Ever get blindsided by a big customer churn?

Ralph Wiggum as a "Software Engineer" (ghuntley.com)

Ruby on Rails in 2025?

Build your own 5-bay NAS using a mini PC mainboard and 3D printed chassis (liliputing.com)

GPT-5 Is Good, Actually: The Agony and Ecstasy of Public Benchmarks (hackbot.dad)

The US-Canadian Road Safety Gap Is Getting Wider (bloomberg.com)

DEF CON Talk: Building a Malware Museum [video] (youtube.com)

The Delta Sigma Modulator (2016) [pdf] (seas.ucla.edu)

Alpine Linux does not make the news (drewdevault.com)

Let Me Be Forgotten (With Lowry Pressly) [audio] (econtalk.org)

The Stock Market Is Getting Scary. Here's What You Should Do (nytimes.com)

Bardcore consists of medieval-inspired remakes of popular songs (en.wikipedia.org)

The doctors' danse macabre: Assisted suicide is a step towards a new dark age (thecritic.co.uk)

Show HN: Gitmore – Automated GitHub/Bitbucket Reporting with AI (gitmore.io)

Show HN: Secure backup for BIP-39 mnemonics using Shamir's Secret Sharing (hackable.github.io)

Profiling LLM Inference on Apple Silicon: A Quantization Perspective (arxiv.org)

Obstructive Sleep Apnea Among Players in the National Football League (pubmed.ncbi.nlm.nih.gov)

Show HN: OverType – A Markdown WYSIWYG editor that's just a textarea

Ask HN: What should developers focus on now that AI can do a lot of coding?

Mayhem in Lives Bug Land: Fixing one of the '90s most famous C64 games (commodoreformatarchive.com)

MS-DOS Development Resources (github.com)

Counting

Show HN: NovaAccess – Alternative SSH client for tailscale network with o VPN (apps.apple.com)

Dozens of women murdered after police use 'deeply flawed' domestic violence tool (telegraph.co.uk)

Radxa Cubie A7A with Allwinner A733, LPDDR5 RAM and 3 Tops NPU Launched (linuxgizmos.com)

Who does your assistant serve?

Comments (13)