Why is grid-wide battery storage capacity measured in power instead of energy? (physics.stackexchange.com)

Main problem with regular (forward-only time) debugging is a state -- memory, CPU, cache etc -- which is contributed to the bug but is completely lost. With time travel debugging that can be saved which is great but now you have a bunch of data that you need to sift through as you trace the bug. Seems like AI is the right tool to save you this drudgery and get to the root cause sooner (or let AI work on it while you do other things in parallel).

This is new. Something that couldn't have been possible without either of time travel debugging or latest AI tech (MCP, code LLMs).

It will be interesting to know what challenges came up in nudging the model to work better with time travel debug data, since this data is novel and the models today might not be well trained for making use of it.

mark_undoio · 4h ago

> It will be interesting to know what challenges came up in nudging the model to work better with time travel debug data, since this data is novel and the models today might not be well trained for making use of it.

This is actually quite interesting - it's something I'm planning to make a future post about.

But basically the LLM seems to be fairly good at using this interface effectively so long as we tuned what tools we provide quite carefully:

* Where we would want the LLM to use a tool sparingly it was better not to provide it at all. When you have time travel debugging it's usually better to work backwards since that tells you the causality of the bug. If we gave Claude the ability to step forward it tended to use it for everything, even when appropriate.

* LLMs weren't great at managing state they've set up. Allowing the LLM to set breakpoints just confused it later when it forget they were there.

* Open ended commands were a bad fit. For example, a time travel debugger can usually jump around in time according to an internal timebase. If the LLM was given access to that, unconstrained, it tended to just waste lots of effort guessing timebases and looking to see what was there.

* Sometimes the LLM just wants to hold something the wrong way and you have to let it. It was almost impossible to get the AI to understand that it could step back into a function on the previous line. It would always try going to the line, then stepping back, resulting in an overshoot. We had to just adapt the tool so that it could use it the way it thought it should work.

The overall result is actually quite satisfactory but it was a bit of a journey to understand how to give the LLM enough flexibility to generate insights without letting it get itself into trouble.

DiffuCoder-7B-CpGRPO: A Coding LLM Developed by Apple (huggingface.co)

Show HN: Tinykv – minimal file-backed key-value store for Rust (crates.io)

The Knot Atlas (katlas.org)

Ask HN: Worth leaving position over push to adopt vibe coding?

Why is grid-wide battery storage capacity measured in power instead of energy? (physics.stackexchange.com)

'Elon has woken up': Musk battles to save Tesla from Trump (ft.com)

Btrfs read-write on FreeBSD: It is possible and works well (2024) (treefort.piusbird.space)

Show HN: Go-nagini fluent wrapper for Cobra (github.com)

Nvidia Is Full of Shit (blog.sebin-nyshkim.net)

Original Text of North Korea's Act on Rejecting Reactionary Ideology and Culture (nktimes.kr)

Show HN: Code Cause – Online community building solutions for the greater good (codecause.dev)

Show HN: I Turned PG's "How to Get Startup Ideas" into an Interactive Course (mythosgym.com)

Why Juggle 3 Platforms for Your Event? Meet Demfati

I Left Quantum Computing Research [video] (youtube.com)

The messy reality of SIMD (vector) functions (johnnysswlab.com)

Tesla's Cybertruck flop is historic. The brand collapse is even worse (dailykos.com)

AmigaLive: Front-end for FS-UAE emulator (amigalive.com)

The new digital banking startup from Anduril CEO Palmer Luckey, and its value (businessinsider.com)

Prompting LLMs is not engineering (dmitriid.com)

The Tech Layoff Tracker (trueup.io)

Unmute: Speak with a text LLM in real time (github.com)

Built: A tool to score kids' movies scene-by-scene using subtitles and LLMs (tinyviewers.vercel.app)

Aardvark'd: 12 Weeks With Geeks [video] (youtube.com)

Ex-Tesla and Google Engineers Raise $4M for AI-Text Detection Startup Pangram (reuters.com)

Memstop: Use LD_PRELOAD to delay process execution when low on memory (github.com)

Invent provides an easy way to explore and use different AI assistants together (agnamihira.medium.com)

The Agentic Software Engineer (dolthub.com)

300k-year-old wooden tools from southwest China (science.org)

The Inviolable Principles of NuttX (2019) (nuttx.apache.org)

Everything around LLMs is still magical and wishful thinking (dmitriid.com)

Show HN: Dumb STT/diction script for sway-Linux (github.com)

Iris: A neurosymbolic framework for vulnerability detection in code (github.com)

Using pragma Shared_Passive for data persistence in Ada (adacore.com)

The Amiga 3000 Unix and Sun Microsystems: Deal or No Deal? (datagubbe.se)

Being too ambitious is a clever form of self-sabotage (maalvika.substack.com)

Cross-Compiling 10k Rust CLI Crates Statically (blog.pkgforge.dev)

Lenovo Bios Simulator Center (download.lenovo.com)

Playing Snake with water with OpenDrop [video] (youtube.com)

From Chatbots to AI Agents: Understanding Modern Agentic Architectures (vmayakumar.wordpress.com)

Show HN: Piano Trainer – Learn piano scales, chords and more using MIDI (github.com)

Analysing Roman itineraries using GIS tooling (link.springer.com)

Islam, Israel, and the Tragedy of Gaza (samharris.substack.com)

Ask HN: Should AIs make errors on purpose to keep human minds sharp?

America Is Killing Its Chance to Find Alien Life (theatlantic.com)

Riff: LLMs Are Software Diamonds (evalapply.org)

A client wants to buy old SaaS app – smart move or risk?

Leaktracer: a Rust allocator to trace memory allocations (blog.veeso.dev)

NebulaStream: High-Performance Streaming Engine for Multi-Modal Edge Apps (dl.acm.org)

Ask HN: Section 174 will be Restored to Normality. How is Everyone Feeling?

People Power: Revisiting the origins of American democracy (2005) (newyorker.com)

Undo × MCP: Time Traveling with Your AI Code Assistant

Comments (2)