Ask HN: Is there a business for extracting US tech talent?

18 points by Arubis 6h ago 12 comments

Ask HN: What are good questions to ask in a remote round in post GPT era?

3 points by ashu1461 4h ago 7 comments

Ask HN: How to make money with SaaS without network or VC funding?

4 points by squareloop 4h ago 3 comments

Ask HN: Freelancer? Seeking freelancer? (July 2025)

83 points by whoishiring 2d ago 186 comments

Ask HN: Who is hiring? (July 2025)

266 points by whoishiring 2d ago 362 comments

Ask HN: What Are You Working On? (June 2025)

430 points by david927 4d ago 1359 comments

Super Simple "Hallucination Traps" to detect interview cheaters

26 points by EliotHerbst 1d ago 32 comments

Ask HN: Who wants to be hired? (July 2025)

128 points by whoishiring 2d ago 339 comments

Ask HN: What are the best resources to help with health insurance denials?

5 points by cigna 10h ago 7 comments

Ask HN: How to create a more human-centric web?

4 points by saubeidl 10h ago 3 comments

Ask HN: I give in, what are the resources for picking up AI-assisted coding?

2 points by dhosek 11h ago 2 comments

Ask HN: What's the 2025 stack for a self-hosted photo library with local AI?

222 points by jamesxv7 3d ago 118 comments

Ask HN: How do I prevent execs from obsessing over copy-protection?

5 points by bad_boomerang 13h ago 7 comments

1KB JavaScript Demoscene Challenge Just Launched

114 points by babakode 2d ago 31 comments

Ask HN: Why there is no demand for my SaaS when competition is killing it?

32 points by drvroom 1d ago 31 comments

Ask HN: Ideas to acquire "good taste" in programming?

4 points by danielciocirlan 20h ago 8 comments

Ask HN: Are AI Copilots Eroding Our Programming Skills?

6 points by buscoideais 1d ago 12 comments

Tell HN: Google says "not vuln", fixes hours later without attribution

17 points by Eikon 14h ago 3 comments

Ask HN: How to Block Spam Mails?

5 points by mpaepper 1d ago 8 comments

Ask HN: 80s electronics book club; anyone remember this illustrator?

34 points by codpiece 6d ago 23 comments

Ask HN: How do I open up my side project to the world?

9 points by picolas 2d ago 12 comments

Ask HN: Should I use microservices or monolithic architecture?

3 points by its_kritix 8h ago 2 comments

Ask HN: Why privacy consent is NOT part of Browser setting?

2 points by the_arun 1d ago 5 comments

Ask HN: How have you shared computers with your young child (~3 to 5)

18 points by msencenb 3d ago 14 comments

Ask HN: How did low contrast text become so pervasive?

22 points by mr-pink 4d ago 26 comments

Ask HN: Startup shutting down, should we open source?

14 points by amadeoeoeo 6d ago 37 comments

Ask HN: Would limiting game size to 5–10 MB spur the creation of novel games?

4 points by amichail 1d ago 4 comments

LinkedIn Locked Me Out Until I Submit to Biometric ID Verification via Persona

10 points by AllanSavageDev 2d ago 5 comments

Ask HN: Anyone is an "AI Engineer"? What does your job tasks include?

12 points by akudha 2d ago 8 comments

Ask HN: Stock Android tablet free of bloatware?

12 points by miki_tyler 3d ago 6 comments

Why Are SaaS Boilerplates Still This Expensive? So I Built My Own

3 points by Shreyan19 22h ago 0 comments

Ask HN: Which Free Software or Open Source Project Needs Help?

15 points by em-bee 4d ago 6 comments

Ask HN: Is noprocrast still working for you?

10 points by infotainment 4d ago 6 comments

It is not possible to install your own addon in Firefox without Moz's approval

7 points by julkali 2d ago 6 comments

Tell HN: (dictionary|thesaurus).reference.com is now a spam site

51 points by akkartik 4d ago 14 comments

Ask HN: Who's using AI to build non-AI products?

4 points by leonagano 2d ago 5 comments

Harsh Working Environment in Japan

15 points by wakuwakustudio 2d ago 23 comments

Ask HN: Better-auth or Nextauth or something else

8 points by dasubhajit 5d ago 0 comments

Context Engineering for the LLM OS: User vs. Kernel Context

2 pacjam 3 7/3/2025, 6:50:17 PM letta.com ↗

Comments (3)

alganet · 11h ago

I like the idea of memory management. Maybe someone experienced in this stuff can help me with some questions:

Is it possible to use this concept to keep a very long session? Tell it to forget things, or replace some part of the memory, without "rebooting" (starting another instance or conversation)?

So far, I'm unable to find any information on how to do that. In the OS analogy, the stuff I found looks more like putting stuff on autoexec.bat to open on the next boot than proper management of memory during execution.

It looks more like "autoexec.bat engineering". Is that the same overall idea?

That time to first token is really expensive, like a reboot. Any tool to reduce it and keep the model running for longer would be a real breakthrough, but I haven't found any practical examples of it, just stuff that does "autoexec.bat" analogues.

wooders · 10h ago

I think the "memory blocks" are essentially what you are describing - to have an infinite session (which systems like Letta is designed for) you have to have a mechanism for organizing the important information and persisting it for future interactions. This organization can be done via tool calls (which was what MemGPT did) or done by other agents in the background. While the message buffer is continues to grow / old messages get evicted, the memory blocks are fixed size and always in context.

alganet · 10h ago

Your answer is too vague for the details I asked.

I could design an autoexec.bat to remember the programs that were opened after reboot, all automatically. If I open something, it goes there. If I close, I remove it from autoexec.bat. MacOS does this. But that's not really the persistence that saves me time and money. MacOS is good because _I rarely need to reboot it_, and the "reopen windows after reboot" option is barely used.

There's one question I placed there that perfectly encapsulates my doubts:

_Can I use this "context engineering" to mitigate the costs of the time for first token?_

If I cannot, then it's just like rebooting an OS, and it is merely the illusion of persistance. I can totally do this on my own just like I can craft hacky autoexec.bat scripts, nothing special about it.

I've seen attempts at doing "snapshotting" of parts of a GPU memory, which are similar to pausing a VM after boot and then restoring it. That's also not what I'm talking about, and it is just an optimization on the process of rebooting and does not improve much on the time for first token (there's a time penalty either way).