Vibechart (vibechart.net)

It's quite ironic that the technology that is displacing so many people from so many industries, has yet to make a profit. I fear the "creative" part of their destruction will take longer to achieve than they advertise.

belter · 42m ago

Nobody has lost their job yet because of AI. But lots of people lost their job, because of the money their CEOs spent on AI.

darth_avocado · 2h ago

What is worse? Terrible charts, terrible charts making it through any form of scrutiny or terrible charts intentionally making it to the main stage.

abeppu · 2h ago

> OpenAI conveniently forgot to include this comparison (ARC-AGI-2) in their livestream recital of benchmark progress, which left the livestream looking like marketing rather than science.

Yeah but it was _supposed_ to be marketing, right? Like, of course a product video isn't science in the same way a "hot take" post also isn't science.

dude250711 · 1h ago

Why is Grok so surprisingly decent? Does lack of mainstream liberal-left censorship (replaced with Musky censorship) result in some sort of a weird performance boost?

pupppet · 1h ago

Is it decent, or does it game the tests? Really, would love to know..

hagbard_c · 10m ago

There's nothing weird about a model performing better when it is built to more closely relate to reality instead of an ideologically tainted version of such. I don't know how much Musk & Co. interfere with the fine tuning of the models but it is clear that this interference is far less heavy-handed than what the other actors do to their models.

slowmovintarget · 52m ago

Yes, actually.

Fewer fingers on the scale means the LLM gets to actually do its thing. GPT-4 with zero filtering was scary smart according to the red teams that were testing it. The version the public got had a lobe tied behind it's back.

Having only Grok 3 to compare, and toying around with GPT-5... GPT-5 is pretty good.

Vibechart (vibechart.net)

GPT-5 (openai.com)

Historical Tech Tree (historicaltechtree.com)

GPT-5: Key characteristics, pricing and system card (simonwillison.net)

Flipper Zero DarkWeb Firmware Bypasses Rolling Code Security (rtl-sdr.com)

GPT-5 for Developers (openai.com)

OpenAI's new open-source model is basically Phi-5 (seangoedecke.com)

Encryption made for police and military radios may be easily cracked (wired.com)

Benchmark Framework Desktop Mainboard and 4-node cluster (github.com)

Cursor CLI (cursor.com)

Building Bluesky comments for my blog (natalie.sh)

Windows XP Professional (win32.run)

Infinite Pixels (meyerweb.com)

How to sell if your user is not the buyer (writings.founderlabs.io)

Show HN: Octofriend, a cute coding agent that can swap between GPT-5 and Claude (github.com)

Open music foundation models for full-song generation (map-yue.github.io)

How AI conquered the US economy: A visual FAQ (derekthompson.org)

Foundry (YC F24) is hiring staff-level product engineers (ycombinator.com)

The Inkhaven Blogging Residency (inkhaven.blog)

Squashing my dumb bugs and why I log build IDs (rachelbythebay.com)

Spatio-temporal indexing the Bluesky firehose (joelgustafson.com)

Gemini CLI GitHub Actions (blog.google)

Achieving 10,000x training data reduction with high-fidelity labels (research.google)

Lightweight LSAT (lightweightlsat.com)

The Q Programming Language (git.urbach.dev)

Show HN: Browser AI agent platform designed for reliability (github.com)

DNA tests are uncovering the true prevalence of incest (2024) (theatlantic.com)

Monte Carlo Crash Course: Quasi-Monte Carlo (thenumb.at)

An LLM does not need to understand MCP (hackteam.io)

Leonardo Chiariglione – Co-founder of MPEG (leonardo.chiariglione.org)

Zero-day flaws in authentication, identity, authorization in HashiCorp Vault (cyata.ai)

A generic non-invasive neuromotor interface for human-computer interaction (nature.com)

Claude Code IDE integration for Emacs (github.com)

The Bus Station That Didn't Exist, and Other Data Epiphanies (nightingaledvs.com)

The Sunlight Budget of Earth (asimov.press)

Show HN: Stasher – Burn-after-read secrets from the CLI, no server, no trust (github.com)

Eggs are off the hook–study reveals bacon's the real heart risk (sciencedaily.com)

Jepsen: Capela dda5892 (jepsen.io)

Arm desktop: emulation (marcin.juszkiewicz.com.pl)

Brazil movie: as prescient as ever, 40 years later (theverge.com)

Preventing ZIP parser confusion attacks on Python package installers (blog.pypi.org)

Lithium compound can reverse Alzheimer’s in mice: study (hms.harvard.edu)

Laptop Support and Usability (LSU): July 2025 Report (github.com)

The Whispering Earring (croissanthology.com)

Splatshop: Efficiently Editing Large Gaussian Splat Models (momentsingraphics.de)

79% of OpenBSD kernel source is AMD DRM (marc.info)

Koalas vs. Crows: An evolutionary theory of software (ajmoon.com)

Debounce (developer.mozilla.org)

Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs (baseten.co)

Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model (github.com)

GPT-5 Hot Take

Comments (8)