Show HN: QuantumFlow Toolkit – An open-source framework hybrid quantum workflows (github.com)

2 points by realjjhaxnews 15m ago 0 comments

Show HN: Spatial Web Browser Engine (m-creativelab.github.io)

11 points by yorkie 6h ago 3 comments

Show HN: Andre – A privacy-first, location-aware assistant that helps you (andreapp.org)

2 points by billellsd 2h ago 0 comments

Show HN: Turn impulse buys into dream investments (nopeit.app)

10 points by pjcodes 7h ago 7 comments

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat (andreinwald.github.io)

137 points by andreinwald 1d ago 50 comments

Show HN: Structured Cooperation – A new way of building distributed apps & POC (github.com)

7 points by gabrielshanahan 11h ago 0 comments

Show HN: NameFast – Instantly generate brandable names for your SaaS or startup

2 points by skyzouw 4h ago 1 comments

Show HN: Phlebas, a live timeseries sim controlled by the console (greenvitriol.com)

3 points by janesconference 5h ago 0 comments

Show HN: I Taught an LSTM to Trade So I Could Sleep Better at Night (wolflux.site)

4 points by Neshanth 6h ago 3 comments

Show HN: Draw a fish and watch it swim with the others (drawafish.com)

911 points by hallak 5d ago 232 comments

Show HN: NaturalCron – Human-Readable Scheduling for .NET (With Fluent Builder) (github.com)

40 points by hugoj0s3 1d ago 12 comments

Show HN: Zomni – An AI sleep coach that personalizes CBT-I for everyday use (apps.apple.com)

3 points by deni_marina 7h ago 0 comments

Show HN: Enforce TDD in Claude Code (github.com)

3 points by Nizoss 8h ago 1 comments

Show HN: Wordle-style game for Fermi questions (fermiquestions.org)

31 points by danielfetz 1d ago 33 comments

Show HN: Voltpeek – Vim-inspired oscilloscope software (github.com)

12 points by schuyler4 1d ago 1 comments

Show HN: Mcp-use – Connect any LLM to any MCP (github.com)

153 points by pzullo 3d ago 68 comments

Show HN: An interactive dashboard to explore NYC rentals data (leaseswap.nyc)

70 points by giulioco 4d ago 51 comments

Show HN: AgentMail – Email infra for AI agents (chat.agentmail.to)

116 points by Haakam21 3d ago 68 comments

Show HN: TraceRoot – Open-source agentic debugging for distributed services (github.com)

40 points by xinweihe 2d ago 16 comments

Show HN: I made a website that makes you cry (cryonceaweek.com)

293 points by johnnymaroney 7d ago 234 comments

Show HN: Pontoon – Open-source customer data syncs (github.com)

44 points by alexdriedger 2d ago 10 comments

Show HN: Rewindtty – Record and replay terminal sessions as structured JSON (github.com)

34 points by debba 5d ago 14 comments

Show HN: AI Physics Tutor with Free Body Diagrams (physicsviewer.com)

6 points by andrewrn 1d ago 0 comments

Show HN: Sourcebot – Self-hosted Perplexity for your codebase (github.com)

101 points by bshzzle 4d ago 27 comments

Show HN: KubeForge – A GUI for Kubernetes YAMLs (github.com)

62 points by rakeda 2d ago 23 comments

Show HN: Print the daily weather forecast on a thermal receipt printer (github.com)

21 points by chr15m 4d ago 6 comments

Show HN: An AI agent that learns your product and guides your users (frigade.ai)

69 points by pancomplex 4d ago 29 comments

Show HN: Mathpad – Physical keypad for typing 100+ math symbols anywhere (crowdsupply.com)

7 points by MagneLauritzen 1d ago 9 comments

Show HN: Companies use AI to take your calls. I built AI to make them for you (pipervoice.com)

231 points by michaelphi 6d ago 173 comments

Show HN: Open-source alternative to ChatGPT Agents for browsing (github.com)

103 points by ElasticBottle 4d ago 23 comments

Show HN: Dlg – Zero-cost printf-style debugging for Go (github.com)

64 points by 0xFEE1DEAD 7d ago 39 comments

Show HN: Realtime Magic-Eye Mirror (namuol.github.io)

2 points by namuol 1d ago 0 comments

Show HN: The Aria Programming Language (github.com)

48 points by egranata_aria 7d ago 14 comments

Show HN: Fetchet – A compact, promise-based, HTTP fetch wrapper (github.com)

2 points by Brysonbw 1d ago 3 comments

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com)

124 points by Danau5tin 5d ago 12 comments

Show HN: A high-altitude low-power flight computer for high-altitude balloons (github.com)

42 points by mpkendall 4d ago 22 comments

Show HN: Online Ruler – Measuring in inches/centimeters (anruler.com)

88 points by artiomyak 7d ago 61 comments

Show HN: Use Their ID – Use your local UK MP’s ID for the Online Safety Act (use-their-id.com)

856 points by timje1 6d ago 279 comments

Show HN: A GitHub Action that quizzes you on a pull request (github.com)

94 points by dkamm 5d ago 33 comments

Show HN: Fast Elevation API with memory mapped tiles (terraintap.com)

2 points by anaj123 1d ago 0 comments

Show HN: Agentic AI Frameworks on AWS (LangGraph,Strands,CrewAI,Arize,Mem0) (github.com)

6 points by thinkagenticai 2d ago 0 comments

Show HN: Open-sourced my prompt management tool for LLM-powered apps (github.com)

3 points by piterrro 1d ago 1 comments

Show HN: Cant – Library written in Rust that provides PyTorch-like functionality (github.com)

50 points by TuckerBMorgan 7d ago 5 comments

Show HN: Convert from MIDI file to ASCII tablature (and more) (github.com)

42 points by ycombiredd 8d ago 12 comments

Show HN: Astro dev blog template with interactive colorschemes (multiterm.stelclementine.com)

31 points by stelcodes 3d ago 4 comments

Show HN: Open-source physical rack-mounted GUI for home lab (getubo.com)

41 points by mmajzoobi 7d ago 3 comments

Show HN: MoebiusXBIN – ASCII and text-mode art editor with custom font support (blog.glyphdrawing.club)

55 points by california-og 4d ago 6 comments

Show HN: Walk-through of rocket landing optimization paper [pdf] (scpowers.github.io)

16 points by scpowers 5d ago 1 comments

Show HN: I built an AI that turns any book into a text adventure game (kathaaverse.com)

280 points by rcrKnight 5d ago 109 comments

Show HN: F1 COSMOS – Live timing and data dashboard for F1 fans (f1cosmos.com)

8 points by conradmk 1d ago 12 comments

Why do LLMs still not run code before giving it to you?

1 highfrequency 3 8/3/2025, 7:58:37 PM

The leading models all advertise tool use including code execution. So why is it still common to receive a short Python script containing a logical bug which would be immediately discoverable upon running a Python interpreter for 0.1 seconds? Is it a safety concern / difficulty sandboxing in a VM? Surely not a resource consumption issue given the price of a single CPU core vs. GPU.

Comments (3)

tlb · 5h ago

Is it a common use case to produce a standalone program that could be tested in isolation? Usually I'm asking for a function (or just a few lines of change) that depends on the rest of my code & environment, so it's not trivial to test.

serf · 36m ago

depends on the methodology really.

if you're doing TDD style work but with an AI it's not uncommon to one-shot a function and then throw it against your battery of tests.

it's also pretty doable if you're writing smallish scripts or trying to follow functional coding paradigms; with functional stuff it's often easy to pick apart the specific modules for testing against criteria.

chasing0entropy · 5h ago

Sounds like an opportunity for you to make the world better by designing the process and implementing it.

No comments yet