Ask HN: Building LLM apps? How are you handling user context?

31 marcospassos 18 5/26/2025, 2:23:27 PM

I've been building stuff with LLMs, and every time I need user context, I end up manually wiring up a context pipeline.

Sure, the model can reason and answer questions well, but it has zero idea who the user is, where they came from, or what they've been doing in the app. Without that, I either have to make the model ask awkward initial questions to figure it out or let it guess, which is usually wrong.

So I keep rebuilding the same setup: tracking events, enriching sessions, summarizing behavior, and injecting that into prompts.

It makes the app way more helpful, but it's a pain.

What I wish existed is a simple way to grab a session summary or user context I could just drop into a prompt. Something like:

const context = await getContext();

const response = await generateText({ system: `Here's the user context: ${context}`, messages: [...] });

Some examples of how I use this:

- For support, I pass in the docs they viewed or the error page they landed on.

- For marketing, I summarize their journey, like 'ad clicked' → 'blog post read' → 'pricing page'.

- For sales, I highlight behavior that suggests whether they're a startup or an enterprise.

- For product, I classify the session as 'confused', 'exploring plans', or 'ready to buy'.

- For recommendations, I generate embeddings from recent activity and use that to match content or products more accurately.

In all of these cases, I usually inject things like recent activity, timezone, currency, traffic source, and any signals I can gather that help guide the experience.

Has anyone else run into this same issue? Found a better way?

I'm considering building something around this initially to solve my problem. I'd love to hear how others are handling it or if this sounds useful to you.

Comments (18)

matt_s · 36d ago

Interacting with LLMs or AI APIs sounds like other software patterns, it doesn't matter that its AI or an LLM really, you are calling a function and providing inputs and expecting output. You get better output when your inputs are tuned to the scenario. Some of your inputs in this paradigm could be considered as optional parameters because you still get output without them.

If you need to remember parts of the inputs in between user sessions then you need to save state of those somewhere to a disk. Databases are a common choice especially in web development but you could also just put things in a file. Another option if this isn't a web development context is to use something like sqlite since it will help organize the data a little better than say CSVs or similar.

coolKid721 · 36d ago

Proper usage of LLMs so you don't just flood them with useless context will just be custom tailored prompts that only include the pertinent context, with prompts saying how it's related to what you're looking for. I don't think there's a cheap way around it maybe on the plus side you can tune them using ai code. I think tools are really over used and over-rated and have had horrible experience with them, nothing beats just custom tailoring stuff and setting up a system around it.

What I do is use elixir pheonix, have a genserver keep track of the user state and I just include the related state in the request and just helper functions to generate the related prompts per type of state/context and append them wherever makes the most sense.

I think LLMs make most sense to be viewed of as singular atomic interactions where you have the whole input (prompt/context/data) and get a concrete output. Everything else just seems like people being lazy trying to avoid thinking about the best way of structuring it. Where you put the context/data and how you include it will vary per prompt or the specific atomic interaction, there is no standard rule each interaction is unique. You have to experiment and see what provides the best output for each kind of request. I'd read Anthropics prompting docs if you haven't it's very good. https://docs.anthropic.com/en/docs/build-with-claude/prompt-...

My way of thinking is just viewing every isolated LLM request as a unique function that is the prompt + llm = a unique function Context is just what data you pass into the function (prompt+llm+settings(temp, etc))(data) to get whatever specific output you want. The prompt includes prewriting user/system messages, system prompt, structured output stuff or whatever. Any single request might lead to 1 or 30 of these that feed back into each other. But yeah based on that it depends on just custom tailoring them for anything, it's pretty conceptual and intellectual but I find it fun but I don't think there's any easy way around it. Having the ability to have all your requests be stateful and modify what goes into the prompt based on the current user state (like genservers/elixir makes very easy) is a nice technical thing that helps though.

ProfessorZoom · 36d ago

I embed tons of separate pieces of information, save the vectors in a db. Embed the user's question, then have a stored procedure in the db to calculate the top 10 (or 20 or 50 depending on the model) similar pieces of information.

I have an editor where I can ask a question and it brings up the most related pieces of info, and if I change any of those pieces it will update the embedding in the db

marcospassos · 36d ago

That's a good approach. But what I'm looking for is a bit different, more like Segment, but for LLMs. Something that when a user lands on your website, clicks around, and interacts with your app, you get a full behavioral context out of the box, including click path, location, language, currency, etc. You can then inject that context directly into your prompt so the LLM understands what the user is doing and responds without guessing or asking.

enos_feedler · 36d ago

What is the application specific scenario that is requiring this context? Everyone has different scenarios and this might not make sense

esafak · 36d ago

I think MCP is the right place to declare the context management API; the C in MCP is Context. As far as building goes, you could build a (universal) context store. I guess the value would be to bring the context closer to the model?

marcospassos · 36d ago

The value is building the context itself.

Using MCP, this could be a method that would get the context to take decisions.

For example, here's an example of how I use it currently:

```

const context = await getContext();

const response = await generateText({ system: `Here's the user context: ${context}`, messages: [...] });

console.log(context);

// "First-time visitor using Google Chrome on a MacBook, browsing from San Francisco.

// Landed on the pricing page from a Google ad, clicked to compare plans,

// then visited the enterprise section before initiating a support chat."

```

It's like a session recorder for LLMs that captures rich user behavior and traits (like device, browser, location, and journey) and turns them into LLM context. Your agent or app instantly becomes more helpful, relevant, and aware without wiring up your own tracking and enrichment pipeline.

esafak · 36d ago

A context inference service sounds valuable but I wonder what your moat would be.

marcospassos · 36d ago

Yep, that's something I'd have to figure out.

max_on_hn · 36d ago

I don't know of anything off-the-shelf, but you could query analytics tools at runtime (e.g. Mixpanel, PostHog) to gather the raw data, and use a generic summarizer to turn that into behavioral context that's usable downstream.

marcospassos · 36d ago

Yeah, exactly. My whole point is to avoid doing all that. It adds up fast. What I really want is something that handles the heavy lifting end-to-end: tracking, interpreting, and outputting a prompt-ready summary like:

"The user landed on the pricing page from a Google ad, clicked to compare plans, then visited the enterprise section before initiating a support chat."

rcarmo · 36d ago

That reads like the kind of session context you’d use for things like breadcrumbs and the like. Just keep a summary going in the user session, re-pack it or summarize it as soon as it gets above a threshold.

bilater · 36d ago

You might find this useful: https://context7.com/

marcospassos · 36d ago

Super interesting! However, it focuses on external sources rather than the user journey.

nico · 36d ago

I haven’t solved this, but sounds super useful!

Would love to have something like a hotjar/analytics script that could automatically collect context and then I could query it to produce context for a prompt

Great idea, you should build it. Then do a Show HN with it

marcospassos · 36d ago

Exactly! Something like a tag you install and then query prompt-ready contexts.

barbazoo · 36d ago

MCP maybe? You could provide tools for the LLM to discover that data at runtime.

marcospassos · 36d ago

It might help with context generation. But honestly, most of the work is still in tracking, processing, enriching (different services, like IP location, etc), and all the plumbing around it.

Ask HN: Who is hiring? (July 2025)

Ask HN: Who wants to be hired? (July 2025)

Ask HN: Freelancer? Seeking freelancer? (July 2025)

1KB JavaScript Demoscene Challenge Just Launched

Ask HN: What Are You Working On? (June 2025)

Ask HN: What's the 2025 stack for a self-hosted photo library with local AI?

Ask HN: Anyone is an "AI Engineer"? What does your job tasks include?

Ask HN: How do I open up my side project to the world?

LinkedIn Locked Me Out Until I Submit to Biometric ID Verification via Persona

It is not possible to install your own addon in Firefox without Moz's approval

Ask HN: Which skill do you believe will take the longest to be replaced by AI?

Ask HN: How have you shared computers with your young child (~3 to 5)

Border search safe TOTP authenticator app?

Ask HN: How did low contrast text become so pervasive?

Ask HN: Who's using AI to build non-AI products?

Ask HN: 80s electronics book club; anyone remember this illustrator?

Ask HN: Stock Android tablet free of bloatware?

Harsh Working Environment in Japan

Ask HN: How to find developers interested in open-source concepts?

Ask HN: Startup shutting down, should we open source?

Ask HN: Which Free Software or Open Source Project Needs Help?

Ask HN: Is noprocrast still working for you?

Ask HN: Does the world economy run on sales people?

Tell HN: (dictionary|thesaurus).reference.com is now a spam site

If macOS is so easy to use, why do I hate using it so much?

Tell HN: Happy Canada Day

Ask HN: Is the header CSS broken for you?

Ask HN: Better-auth or Nextauth or something else

Ask HN: Where do you host your Go apps

Tell HN: Microsoft abruptly pulls plug on startup lifeline program

A reverse-delta backup strategy – obvious idea or bad idea?

Something 'deeper' than Emacs, or am I looking for a unicorn?

Ask HN: What do use for private service monitoring?

Ask HN: Languages Designed for WASM?

Ask HN: Why aren't AIs being used as app beta testers yet?

Ask HN: What Happened to James Halliday ( Substack)?

Ask HN: Alternatives to Cloudflare for DNS?

Ask HN: Why does my Node.js multiplayer game lag at 500 players with low CPU?

Canon selphy cp1500 privacy concerns

What's the best gem you've found on Hacker News?

Ask HN: Anyone interested in taking over my indie app?

Ask HN: How Are You Reading HN in June 2025?

The 90% Gravity Problem: Why We Tend to Quit Right Before the Finish Line

Ask HN: Building LLM apps? How are you handling user context?

Comments (18)