Top Stories

Comments (0)

VoidWhisperer · 1d ago

Given that they've shown that they don't shy away from making questionable changes to Grok's main system prompt (i.e. the one telling it to not worry about being politically incorrect which subsequently lead to it being able to be pushed into making extremely antisemitic comments)[0], I would hope any company approaches this with that wariness in mind.

[0]: https://techcrunch.com/2025/07/09/x-takes-grok-offline-chang...

No comments yet

andsoitis · 1d ago

The nonprofit Arc Prize says that Grok achieves a new state-of-the-art score on its ARC-AGI-2 test — another difficult benchmark that consists of puzzle-like problems where an AI has to identify visual patterns — scoring 16.2%. That’s nearly twice the score of the next best commercial AI model, Claude Opus 4.

simianwords · 1d ago

How can I try out grok 4 heavy? I can't find API or openrouter access.

ksynwa · 1d ago

What's the model that the @grok twitter account uses?

bravetraveler · 1d ago

:latest

I kid but it's roughly that predictable; whatever was pushed last.

Top DNS domains seen on the Quad9 recursive resolver array each day (github.com)

Show HN: Vibe Kanban – Kanban board to manage your AI coding agents (github.com)

Bill Atkinson's psychedelic user interface (patternproject.substack.com)

Astronomers race to study interstellar interloper (science.org)

Repaste Your MacBook (christianselig.com)

Turmeric is the culprit in a global lead poisoning mystery (2024) (npr.org)

At Least 13 People Died by Suicide Amid U.K. Post Office Scandal, Report Says (nytimes.com)

Andrew Ng: Building Faster with AI [video] (youtube.com)

Upgrading an M4 Pro Mac mini's storage for half the price (jeffgeerling.com)

Pa. House passes 'click-to-cancel' subscription bills (pennlive.com)

AI agent benchmarks are broken (ddkang.substack.com)

Overtourism in Japan, and how it hurts small businesses (craigmod.com)

Show HN: Pangolin – Open source alternative to Cloudflare Tunnels (github.com)

Kimi K2 (twitter.com)

In a First, Solar Was Europe's Biggest Source of Power Last Month (e360.yale.edu)

U.S. abandons hunt for signal of cosmic inflation (science.org)

OpenFront: Realtime Risk-like multiplayer game in the browser (openfront.io)

Recovering from AI Addiction (internetaddictsanonymous.org)

The day someone created 184 billion Bitcoin (2020) (decrypt.co)

LLM Inference Handbook (bentoml.com)

The ChompSaw: A benchtop power tool that's safe for kids to use (core77.com)

I'm done with social media – Or: why I have a blog now (carolinecrampton.com)

Postgres LISTEN/NOTIFY does not scale (recall.ai)

Some arguments against a land value tax (2024) (lesswrong.com)

FP8 is ~100 tflops faster when the kernel name has "cutlass" in it (twitter.com)

Batch Mode in the Gemini API: Process More for Less (developers.googleblog.com)

Things I learned from 5 years at Vercel (leerob.com)

Show HN: Interactive pinout for the Raspberry Pi Pico 2 (pico2.pinout.xyz)

Underwater turbine spinning for 6 years off Scotland's coast is a breakthrough (apnews.com)

Btrfs Allocator Hints (lwn.net)

What is Realtalk’s relationship to AI? (2024) (dynamicland.org)

Walking every street in New York City (imjustwalkin.com)

FOKS: Federated Open Key Service (foks.pub)

Apple vs the Law (formularsumo.co.uk)

Flix – A powerful effect-oriented programming language (flix.dev)

Graphical Linear Algebra (graphicallinearalgebra.net)

The Complete MCP Experience: Full Specification Support in VS Code (code.visualstudio.com)

Red Hat Technical Writing Style Guide (stylepedia.net)

Series of posts on HTTP status codes (2018) (evertpot.com)

Anthropic Is Bleeding Out (wheresyoured.at)

Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)" (simonwillison.net)

Show HN: Open source alternative to Perplexity Comet (browseros.com)

At Amazon's biggest data center, everything is supersized for AI (nytimes.com)

Operational Apple-1 Computer for sale [video] (youtube.com)

Analyzing database trends through 1.8M Hacker News headlines (camelai.com)

Diffsitter – A Tree-sitter based AST difftool to get meaningful semantic diffs (github.com)

Orwell Diaries 1938-1942 (orwelldiaries.wordpress.com)

Gut microbes could protect us from toxic 'forever chemicals' (cam.ac.uk)

Measuring the impact of AI on experienced open-source developer productivity (metr.org)

Launch HN: Leaping (YC W25) – Self-Improving Voice AI

Comments (0)