FreeBSD Scheduling on Hybrid CPUs (wiki.freebsd.org)

1 points by fntlnz 2m ago 0 comments

Prophet of the Human-Built World: An Introduction to John Ruskin (comment.org)

1 points by rslice 5m ago 0 comments

Porsche Track Precision Gateway reverse engineering (991/981 Generation) (youtube.com)

1 points by artem981 6m ago 1 comments

A Full-Chain Exploit of an Unfused Qualcomm Device (hhj4ck.github.io)

1 points by timschumi 6m ago 0 comments

Made a budgeting app focused on speed and ease of use (budgetmate.ai)

1 points by Kodence 7m ago 1 comments

Linear sent me down a local-first rabbit hole (bytemash.net)

2 points by jcusch 7m ago 0 comments

Line Length (blog.glyph.im)

2 points by Bogdanp 14m ago 0 comments

Now, the Artificial Intelligence can edit files on Linux. 100% privacy [video] (youtube.com)

2 points by grigio 23m ago 0 comments

Take Back Our Digital Infrastructure to Save Democracy (techdirt.com)

4 points by BallsInIt 29m ago 0 comments

Cangjie Programming Language Overview – CodeAbbey (codeabbey.com)

2 points by thunderbong 29m ago 0 comments

Work Shapes Your Freedom (substack.com)

2 points by shadowvoxing 37m ago 0 comments

How Hackers can trick Windows Hello into thinking it's you, break into your PC (neowin.net)

2 points by bundie 38m ago 0 comments

Turn Any Website into an API (parse.bot)

3 points by pcl 42m ago 0 comments

The enduring puzzle of static electricity (pubs.aip.org)

5 points by EvgeniyZh 51m ago 0 comments

Feeding the Slop Machine (youtube.com)

2 points by ap-hyperbole 51m ago 0 comments

Ask HN: Prepping my first open-source release, would you use this?

4 points by lukedwcooper 1h ago 1 comments

Exposed to the Bare Bone: When Private Medical Scans Surface on the Internet (modat.io)

3 points by gnabgib 1h ago 0 comments

Beyond good vibes: Securing AI agents by design (yanirseroussi.com)

3 points by yanir 1h ago 0 comments

New Copilot for Gaming Aims to Save You Time, Help You Get Good (news.xbox.com)

2 points by gnabgib 1h ago 0 comments

Digital Foundry Leaves IGN, Now Independent (digitalfoundry.net)

4 points by zdw 1h ago 0 comments

AI Teammate Has Arrived: Xbox's New Gaming Copilot Is Here (securityonline.info)

2 points by kPwn 1h ago 1 comments

She Has Good Looks and Attractive to Me (etechx.co.ke)

2 points by Manyi 1h ago 0 comments

McKinsey and its peers need a strategic rethink (economist.com)

6 points by petethomas 1h ago 0 comments

GPT-5 Doesn't know it is GPT-5 (imgur.com)

5 points by jablongo 1h ago 2 comments

Show HN: I built a simple tool to automate data into Google Sheets and BigQury (syncrange.com)

3 points by RyanDavid 1h ago 0 comments

Could the U.S. Have Saved Navalny? (wsj.com)

4 points by mudil 1h ago 0 comments

GptApiToOSSMigrator – Migrate OpenAI APIs to Local OSS Models (github.com)

2 points by saurabhyer 1h ago 1 comments

Flycrypto – Book Flights and Hotels with Bitcoin and Crypto

2 points by flycrypto 1h ago 0 comments

Dollar Street – Photos from families with different incomes (gapminder.org)

3 points by uneven9434 1h ago 0 comments

Japan Air Lines Flight 123 (en.wikipedia.org)

1 points by colinprince 2h ago 0 comments

The Potato's Mysterious Family Tree Revealed–and It Includes Tomatoes (scientificamerican.com)

2 points by petethomas 2h ago 0 comments

Ask HN: Other funny public mishaps like OpenAI bar chart? (old.reddit.com)

1 points by bkls 2h ago 1 comments

Blueberry Hill (kieranhealy.org)

3 points by interpol_p 2h ago 0 comments

Digital Pet ID with QR Code – Keep Your Pet Safe and Connected (petidgenerator.com)

2 points by alenguo 2h ago 2 comments

GPT-5 leaked system prompt (gist.github.com)

187 points by maoxiaoke 2h ago 143 comments

Convert your legacy liability into a competitive advantage (legacy-modernization.io)

2 points by mooreds 2h ago 0 comments

The Paranoid Style in American Politics (1964) (harpers.org)

25 points by mitchbob 2h ago 5 comments

3D Printing Radiance Fields (arxiv.org)

2 points by E-Reverance 3h ago 0 comments

GPT-5: "How many times does the letter b appear in blueberry?" (bsky.app)

27 points by minimaxir 3h ago 5 comments

Visionaries Turn into Authoritarians (phys.org)

11 points by gsf_emergency_2 3h ago 3 comments

Supernovas AI – All-in-One Tool to Chat with Every Top AI Model and Your Data (supernovasai.com)

1 points by saljump 3h ago 1 comments

Rails queue adapter for mindful developers. Accepts all jobs, executes none (github.com)

1 points by mooreds 3h ago 0 comments

Ask HN: Other options for better resolution on macOS?

1 points by coro_1 3h ago 0 comments

How did a debate over housing become a call to end the anti-monopoly movement? (thebignewsletter.com)

9 points by jez 3h ago 2 comments

Show HN: Scheduled PC Tasks, automatically schedule simulations of actions on PC (apps.microsoft.com)

1 points by AmirHammoutene 3h ago 0 comments

Vibe Coding: The Hot Hand (twitter.com)

2 points by mmorearty 3h ago 1 comments

New executive order puts all grants under political control (arstechnica.com)

125 points by pbui 3h ago 63 comments

AI Art Resources (jonathandinu.com)

1 points by clearspandex 3h ago 0 comments

Beyond Naive RAG: Practical Advanced Methods by Hamel Husain and Shreya Shankar (maven.com)

1 points by simonpure 3h ago 0 comments

Trump opens door for 401(k) plans to invest in crypto (pbs.org)

9 points by geox 3h ago 0 comments

GPT-5: "How many times does the letter b appear in blueberry?"

27 minimaxir 5 8/8/2025, 2:51:33 AM bsky.app ↗

Comments (5)

axdsk · 1h ago

“It’s like talking to a PhD level expert” -Sam Altman

https://www.youtube.com/live/0Uu_VJeVVfo?si=PJGU-MomCQP1tyPk

schoen · 1h ago

These are always amazing when juxtaposed with apparently impressive LLM reasoning, knowledge, and creativity. You can trivially get them to make the most basic mistakes about words and numbers, and double down on those mistakes, repeatedly explaining that they're totally correct.

Have any systems tried prompting LLMs with a warning like "You don't intuitively or automatically know many facts about words, spelling, or the structure or context of text, when considered as text; for example, you don't intuitively or automatically know how words or other texts are spelled, how many letters they contain, or what the result of applying some code, mechanical transformation, or substitution to a word or text is. Your natural guesses about these subjects are likely to be wrong as a result of how your training doesn't necessarily let you infer correct answers about them. If the content or structure of a word or text, or the result of using a transformation, code, or the like on a text, is a subject of conversation, or you are going to make a claim about it, always use a tool to confirm your intuitions."?

mikestorrent · 29m ago

This is a great idea. Like, if someone asked me to count the number of B's in your paragraph, I'd yeet it through `grep -o 'B' file.txt | wc -l` or similar, why would I sit there counting it by hand?

As a human, if you give me a number on screen like 100000000, I can't be totally sure if that's 100 Million or 1 Billion without getting close and counting carefully. Should ought have my glasses. Mouse pointer helps some as an ersatz thousands-separator, but still.

Since we're giving them tools, especially for math, it makes way more sense to start giving them access to some of the finest tools ever. Make an MCP into Mathematica or Matlab and let the LLM write some math and have classical solvers actually deal with the results. Let the LLM write little bits of bash or python as its primary approach for dealing with these kinds of analytical questions.

It's like giving a kid a calculator...

Erem · 14m ago

With data starvation driving ai companies towards synthetic data I’m surprised that an easily synthesized problem like this hasn’t been trained out of relevance. Yet here we are with proof that it hasn’t

HsuWL · 2h ago

I love this test. Demonstrates the "understanding" process of the language model.