A complete formalization of Fermat's Last Theorem for regular primes in Lean (arxiv.org)

1 points by ColinWright 2m ago 0 comments

Understanding Memory Management, Part 6: Basic Garbage Collection (educatedguesswork.org)

1 points by ibobev 6m ago 0 comments

Noam Chomsky: The False Promise of ChatGPT (nytimes.com)

2 points by simonebrunozzi 8m ago 0 comments

Ask HN: I found a bug that lets me use YC partner perk free.what should I do?

1 points by bugtesting-one 12m ago 0 comments

An Architectural Approach to Decentralization (infocentral.org)

1 points by Bogdanp 13m ago 0 comments

SelfDB: The last Back end as a service you will pay for (selfdb.io)

1 points by selfdb_io 16m ago 1 comments

Can shoes be made in the US without cheap labour? (bbc.com)

2 points by dabinat 17m ago 0 comments

Dart and WebAssembly with JavaScript Interop (nick-fisher.com)

1 points by nmfisher 20m ago 0 comments

How I Passed the AWS Certified Security – Specialty (SCS-C02) Exam in 2025 (thehiddenport.dev)

1 points by ejher 21m ago 0 comments

Show HN: Mockstar – AI mock interviews and feedback for jobseekers (mockstar.co)

1 points by mattdotam 21m ago 0 comments

Jio and Jio-Fiber Down in Parts of India

1 points by saharshpruthi 24m ago 0 comments

What is cosh(List(Bool))? Or beyond algebra: analysis of data types (cofault.com)

1 points by fanf2 28m ago 0 comments

Google Is Scamming Users with VEO 3, While Delivering VEO 2 Instead

3 points by machmadera 28m ago 1 comments

The right way to make AI part of your tech strategy (leaddev.com)

1 points by argoeris 29m ago 0 comments

SAZ Caption AI (reach-boost-captions-craft.lovable.app)

2 points by sigma-male 31m ago 2 comments

Show HN: Compiler for Writing Ethereum Smart Contracts with TypeScript

2 points by chase-manning 33m ago 0 comments

Show HN: Better Docx Import and Export Support for Tiptap Editor

7 points by philipisik 34m ago 0 comments

Timdle (timdle.com)

2 points by kaharvi 36m ago 0 comments

Choosing where to spend my team's effort (frederickvanbrabant.com)

2 points by TheEdonian 38m ago 0 comments

A Systematic Review and New Analyses of the Gender-Equality Paradox (journals.sagepub.com)

2 points by mpweiher 38m ago 0 comments

Jordan's black refugees (weeklygazette.substack.com)

2 points by progju 42m ago 0 comments

Apple quietly makes running Linux containers easier on Macs (zdnet.com)

2 points by abricq 43m ago 0 comments

Best Antidetect Browser Setups for Social Media Marketers

1 points by RainbowJ 44m ago 0 comments

The Gnarly Man (en.wikipedia.org)

1 points by nobody9999 44m ago 0 comments

Show HN: Shame Meter (twitter.com)

3 points by madinmo 47m ago 0 comments

Technical co-founder, built everything. Offered 4%. Oof

4 points by cabbagepancakes 56m ago 3 comments

Show HN: Gifty – A real-world gift hunt you play with your feet (gifty-en.vercel.app)

1 points by mrtranlyvu 58m ago 0 comments

Show HN: A Chrome extension that highlights one sentence at a time while reading (github.com)

1 points by hamsteak 59m ago 0 comments

.NET Performance Testing: What Is Important to Know in 2025? (belitsoft.com)

1 points by Aninay 1h ago 0 comments

Use Copilot Agent Mode in Visual Studio (Preview) (learn.microsoft.com)

1 points by nsoonhui 1h ago 0 comments

Warner Bros: fright night for bondholders (bondvigilantes.com)

2 points by Ozarkian 1h ago 0 comments

Google Chrome Music Video (youtube.com)

1 points by ankitrgadiya 1h ago 0 comments

Founders: How do you audit code quality, infra costs, and dev team efficiency?

1 points by satya9099 1h ago 0 comments

Show HN: Life Anti-Checklist (antichecklist.com)

1 points by alvinunreal 1h ago 0 comments

An Experimental New Dating Site Matches Singles Based on Their Browser Histories (wired.com)

2 points by isaacfrond 1h ago 0 comments

Why Vaire is building reversible computers (worksinprogress.news)

1 points by bensouthwood 1h ago 0 comments

Scientists detect light passing through entire human head for brain imaging (medicalxpress.com)

3 points by isaacfrond 1h ago 0 comments

Founders: How do you audit code quality, infra costs, and dev team efficiency?

1 points by satya9099 1h ago 0 comments

Show HN: I Built a Landing page analyzer (layzr.ai)

1 points by Onur_b 1h ago 0 comments

Riichi Mahjong Strategy Books (dainachiba.github.io)

1 points by iNic 1h ago 0 comments

AI Reliability Engineering: Welcome to the Third Age of SRE (thenewstack.io)

1 points by kiyanwang 1h ago 0 comments

ChatGPT Tells Users to Alert the Media That It Is Trying to 'Break' People (gizmodo.com)

3 points by isaacfrond 1h ago 0 comments

Building Efficient and Secure Container Environments (ataiva.com)

1 points by fside 1h ago 0 comments

Matrix Is Cooked (blog.cyrneko.eu)

1 points by cheshire_cat 1h ago 0 comments

What the Arc Browser Story Reveals About the Future of Browser Security (labs.sqrx.com)

1 points by botanicals6 1h ago 0 comments

People of Netua (blog.quineglobal.com)

1 points by ironmagma 1h ago 0 comments

Show HN: I made SEO backlink exchange platform using vector embeddings (babylovegrowth.ai)

1 points by overpower 1h ago 0 comments

First Steps with Logical Replication in PostgreSQL (boringsql.com)

2 points by radimm 1h ago 0 comments

Building a Hardened Amazon Linux 2 AMI for Secure EC2 Deployments (thehiddenport.dev)

1 points by ejher 1h ago 0 comments

Autonomous Vehicles in Norway – Winter Edition (blog.vfiles.no)

2 points by furkansahin 1h ago 0 comments

Why Claude's Comment Paper Is a Poor Rebuttal

7 vectorhacker 2 6/16/2025, 1:46:42 AM victoramartinez.com ↗

Comments (2)

low_tech_love · 27m ago

A fundamental problem that we’re still far away from solving is not necessarily that LLMs/LRMs cannot reason the same way that we do (which I guess should be clear by now); but that they might not have to. They generate slop so fast that, if one can benefit a little bit from each output, i.e. if you can find a little bit of use hidden beneath the mountain of meaningless text they’ll create, then this might still be more valuable than preemptively taking the time to create something more meaningful to begin with. I can’t say for sure what is the reward system behind LLM use in general, but given how much money people are willing to spend with models even in their current deeply flawed state, I’d say it’s clear that the time savings are outweighing the mistakes and shallowness.

Take the comment paper, for example. Since Claude Opus is the first author, I’m assuming that the human author took a backseat and let the AI build the reasoning and most of the writing. Unsurprisingly, it is full of errors and contradictions, to a point where it looks like the human author didn’t bother too much to check what was being published. One might say that the human author, in trying to build some reputation by showing that their model could answer a scientific criticism, actually did the opposite: it provided more evidence that its model cannot reason deeply, and maybe hurt their reputation even more.

But the real question is, did they really? How much backlash will they possibly get from submitting this to arxiv without checking? Would that backlash keep them from submitting 10 more papers next week with Claude as the first author? If one puts in a balance the amount of slop you can put out (with a slight benefit) vs. the bad reputation one gets from it, I cannot say that “human thinking” is actually worth it anymore.

iLoveOncall · 1m ago

Mediocre people produce mediocre work. Using AI might make those mediocre people produce even worse work, but I don't think it'll affect competent people who have standards regardless of the available tooling.

If anything the outcome will be good: mediocre people will produce even worse work and will weed themselves out.