Where Did All Those Brave Free Speech Warriors Go? (techdirt.com)

The proof superficially look super interesting. Especially bc it's not in style of usual LLM babble fillers. It's like almost exactly opposite, very efficient use of words and eliminating parts of grammar not important. Reminds me of how people write down proofs in drafts/how we communicate proofs with peers before writing final versions.

Davidzheng · 4h ago

P1 has in setup section basically a very precise summary of proof which it fills in later "So main is: (a) for n>=4, any n-line cover must contain a side-line; inductively reduce to n=3. (b) Analyze n=3 exactly."

I suspect there's some (tree-based?) search + separate process verifier + large # of parallel generation sessions. Coming just from hints of how structured/monotone the generated text is.

A lot of colons. like So: Now: Need: etc..

Davidzheng · 4h ago

P3: interesting that in the basics section, it makes an easy observation but no proof sketch. unlike P1/P2 (P1 has full proof idea sketch P2 says we'll bash). This suggests actually the whole proof is generated one-shot (unlike my previous comment). I guess it's not doing search in the text space (like output some line search for next line etc). OFC there's probably some final process outputing the proof from some parts so it could be obfuscated the search.

come to think of it, informal proof gen probably can't easily use search? Probably it's doing parallel generation with some information sharing + global verification process. No real evidence except for the fact that the entire proof is very unstructured despite at each line it's written with some style consistency.

Davidzheng · 4h ago

P2 is geometry. It looks coordinate bashed? Very interesting to see it writing Good. and Perfect. after some lines. Very human-like in thinking process. It reads like a person talking about the proof orally.

Davidzheng · 4h ago

I posted about one of the twitter threads at https://news.ycombinator.com/item?id=44613840

ocfnash · 5h ago

According to the 6/N from this series, they are claiming full marks for problems 1 -- 5

https://x.com/alexwei_/status/1946477742855532918

energy123 · 4h ago

This is incredible. We know these questions are not in the training data. How can you still say that LLMs aren't reasoning.

Where Did All Those Brave Free Speech Warriors Go? (techdirt.com)

Interesting thoughts on the limits of AI in the context of software development (ufried.com)

A Look Back at WeChat's PhxSQL and the 'Fastest Majority' (supasaf.com)

New Russian law criminalizes online searches for controversial content (washingtonpost.com)

DunedinPACNI estimates the longitudinal Pace of Aging from a single brain image (nature.com)

Why Is ReactOS Development So Undervalued?

AI guzzled books without permission. Authors are fighting back (washingtonpost.com)

Kimi K2 scored 59% on the aider polyglot coding benchmark (twitter.com)

Spectrally Tunable Lighting: How LEDs can emulate blackbody emitters (enody.lighting)

'I was floored by the data': Psilocybin shows anti-aging properties (livescience.com)

Extending Iterated, Spatialized Prisoners Dilemma to Understand Multicellularity (lksshw.github.io)

Field Guide to the North American Weigh Station (hackaday.com)

uv 0.8 (github.com)

Is automating your AI too hard? Let AI automate that too (github.com)

Origami Space Planes Could Solve a Major Problem in Orbit (gizmodo.com)

Scenarios for solar radiation modification need to include perceptions of risk (iopscience.iop.org)

Google Backs 10 New Nuclear Reactors for AI, Built by AI. What Could Go Wrong? (pcmag.com)

Karen Hao – Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI (youtube.com)

Death by AI (davebarry.substack.com)

Elon Musk's Starlink internet works great if hardly anyone uses it (washingtonpost.com)

Angel vs. Devil Accounting: Reviving a 500-Yr-Old Idea for Modern Mental Health (ledgeroflife.blog)

That how to calculate the hours that you worked and revenue amount (billr.us)

Groq's First Compound AI System (groq.com)

Why you should choose HTMX for your next web-based side project (2024) (hamy.xyz)

Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad (matharena.ai)

The Epic Battle for AI Talent–With Exploding Offers, Secret Deals and Tears (wsj.com)

Ask HN: Looking for UE5 Devs and Artists for My Open Concept – Solvaldr

How a Florida Pension Fund manager produced 50 years of market-beating returns (barrons.com)

Kubernetes Observability with OpenTelemetry Helm Charts [A guide I wish I had (signoz.io)

Japan Uses Drones to Light Up Exit Signs at Concerts and Events (thecsrjournal.in)

Intel CPU owners might be crashing "because of the summer heat" says Firefox dev (pcguide.com)

Netflix uses AI effects for first time to cut costs (bbc.com)

Mozilla's SSL Configuration Generator (ssl-config.mozilla.org)

Five Years After (isonomiaquarterly.com)

IoRing_Demos (github.com)

Welcome to the Internet [video] (youtube.com)

Engineers achieve efficient integration of quantum dot lasers on silicon chips (phys.org)

In First Yr, Rubin Observatory Will Get More Data Than Total of Prev Telescopes (jalopnik.com)

Experimenting with SQL:2023 Property-Graph Queries in Postgres 18 (gavinray97.github.io)

What is Chronic Pain, Really? (#2) (sailhealth.substack.com)

Vibe Coding and Robocop (remysharp.com)

The Physics of Dissonance [video] (youtube.com)

AI Is Boring (moth.monster)

The U.K. Closed a Tax Loophole for the Global Rich. Now They're Fleeing (wsj.com)

Show HN: I built an AI agent that helps me invest (github.com)

Kimi K2: Open Agentic Intelligence (moonshotai.github.io)

Galaxy Simulations Accelerated by Surrogate Modeling for Supernova Feedback (iopscience.iop.org)

Nobel Laureate Busts the AI Hype [video] (youtube.com)

How Digital Is Germany? (mertbulan.com)

Meta Hires Two Key Apple AI Experts After Poaching Their Boss (bloomberg.com)

OpenAI claiming gold medal standard at IMO 2025

Comments (7)