Show HN: Trim – interactive, personalized summaries for students (app.trim.run)

We’ve been working on this problem off and on for over a year now. Many models bake knowledge of particular tools/libraries/patterns into their weights very well and others quite poorly. In my experience Claude is quite good at integrating the dog.ceo API and noticeably ignorant when it comes to Postgres features, and it knows gcloud commands enough to very confidently and consistently hallucinate arguments.

We’ve baked a solution to this into our product, so if anybody is working on an API/SDK/etc feel free to contact me if your users are running into problems using LLMs to integrate them.

One thing we’ve noticed is that subtle changes to library/api integration prompts’ context can be surprisingly impactful. LLMs do very well with example commands and explicit instructions to consider X, Y, and Z. If you just dump an API reference and information that implicitly suggests that X, Y, and Z might be beneficial, they won’t reliably make the logical leaps you want them to unless you let them iterate or “think” (spend more tokens) more. But you can’t as easily provide an example for everything, and the ones you do will bias the models towards them, so you may need a bit of both.

EGreg · 13m ago

I made a provisional patent this year, about how exactly I would solve this problem. Imagine hiring a "team of developers" who can learn your library and iterate 24/7, improving things, doing support, even letting the pointy-haired boss turn his ideas into reality in a forked sandbox on the weekend.

Normally I don't like software patents, but in the case of AI, I have made an exception. I have also rethought how I am going to do open source vs closed source in my AI business.

(If anyone wants to work with me on this, hit me up, email is in my profile) https://grokers.ai/patent.pdf

weitendorf · 10m ago

I hope we don’t have to challenge it!

We’re trying to build a similar kind of experience but for both “sides” of the problem: software provider and software users/integrators.

rikroots · 51m ago

I've done a lot of work recently to make my library more "LLM Friendly", but I'm not willing at this time to sign up to a service which I don't know I'd ever use again just to run a test on your behalf. If you want to run the test on my library then its GitHub can be found here: https://github.com/KaliedaRik/Scrawl-canvas

richardblythman · 2h ago

If coding agents are the new entry point to your library, how sure are you that they’re using it well?

I asked this question to about 50 library maintainers and dev tool builders, and the majority didn't really know.

Existing code generation benchmarks focus mainly on self-contained code snippets and compare models not agents. Almost none focus on library-specific generation.

So we built a simple app to test how well coding agents interact with libraries: • Takes your library’s docs • Automatically extracts usage examples • Tasks AI agents (like Claude Code) with generating those examples from scratch • Logs mistakes and analyzes performance

We’re testing libraries now, but it’s early days. If you're interested: Input your library, see what breaks, spot patterns, and share the results below.

We plan to expand to more coding agents, more library-specific tasks, and new metrics. Let us know what we should prioritize next.

bdhcuidbebe · 46m ago

> If coding agents are the new entry point to your library, how sure are you that they’re using it well?

> I asked this question to about 50 library maintainers and dev tool builders, and the majority didn't really know.

Why should they even bother to answer such a loaded and hypothetical question?

justonceokay · 57m ago

If making dev tooling is selling shovels to the miners, then this is like selling sheet metal to the shovel makers.

weitendorf · 16m ago

Let’s meet and see if it might make sense for us to team up. We’re working on this from the agent/library-specific-task side, and we might be better than chatgpt at marketing your product :)

dotancohen · 1h ago

Note that this comment is not hijacking. The author of this comment is also the author of the post.

add-sub-mul-div · 26m ago

That's the more likely assumption. Accounts with only self-promotion spam activity have become more of a rule here than an exception.

spankalee · 43m ago

Why do we need to log in?

metadat · 58m ago

The skip-to-the-end answer: Context7 MCP is so good it seems like magic, even to many well-informed, highly capable hackers. Simply wildly good for libraries and SDKs. All it takes to start using it is to add the MCP provider to your agent config and save your arms, "Use Context7 for this".

https://context7.com/

nogridbag · 12m ago

I'm confused a bit by this. For instance, Gemini was struggling to write proper Java code for using Firebase Admin SDK. It would write Java code using methods that only exist in the JavaScript SDK. And when I would correct it, it would give other options that also were only in the JavaScript SDK or were invalid.

So I thought this is where context7 would be useful, but I'm confused what I'm looking at in the detail page: https://context7.com/firebase/firebase-admin-java

I was expecting some sort of dump of all the admin methods, but it gives a single example of one library function and info on how to build javadoc.

k3liutZu · 48m ago

What is the best approach to have something like context7 for internal tools and libraries?

suyash · 41m ago

I'd use this if this was an open source tool.

paularmstrong · 40m ago

Needing too sign up before I can see or do anything made me close the tab immediately.

Show HN: Trim – interactive, personalized summaries for students (app.trim.run)

Show HN: Claim your AI agent's unique name before someone else does

Why Clojure? (blog.cleancoder.com)

Altman Backs brain chip startup. Strategic? Or to piss of Musk? (qz.com)

Quality Wednesdays: How Linear trained its team to see what doesn't work (linear.app)

Attorney General James Sues Company Behind Zelle for Enabling Widespread Fraud (ag.ny.gov)

webOS Samples (webosose.org)

Show HN: The U Programming Language (gist.github.com)

Bias as a Fix for Congestion (gojiberries.io)

New York AG James sues Zelle parent company for alleged fraud (cnbc.com)

RoboCop Rogue City (robocop-roguecity.com)

"Bullshit Index" Tracks AI Misinformation (spectrum.ieee.org)

Rise of the Everything Apps (dinoki.substack.com)

AI Is Different (antirez.com)

Arch shares its wiki strategy with Debian (lwn.net)

Ask HN: In self-serve SaaS – how do you get paying users on exploration calls?

Hyder and Stewart: A Tale of Two Border Towns (2018) (johnzada.com)

Seeing Growing Exodus, State Organ Donor Registries Urge 'Perspective' (newsweek.com)

Ask HN: Going to bed with *unsolved* problems in your head?

Berkshire Hathaway's Website looks like it's from the 90s (berkshirehathaway.com)

Beyond Parity: The Case for True Accessibility Affordances (devinprater.micro.blog)

Unplugged – co-founded by Erik Prince – releases new "privacy-first" smartphone (theverge.com)

We found TeaOnHer spilling users' driver's licenses in less than 10 minutes (techcrunch.com)

Dolthub/go-MySQL-server: A MySQL-compatible database, in pure Go (github.com)

Show HN: XferLang, a data-transfer and configuration alternative to JSON (xferlang.org)

Designing the Built-In AI Web APIs (domenic.me)

AI Therapy Bot

The Great Geothermal Talent Shortage (oilprice.com)

Multi-Dimensional Vector Support in CocoIndex – Underneath Explained (cocoindex.io)

Gemini adds Temporary Chats and new personalization features (blog.google)

Why Are Digital Systems Failing the People They're Meant to Serve? (syntheticauth.ai)

Type Inference for Plain Data (haskellforall.com)

The only thing that matters (2007) (pmarchive.com)

Ridges.ai – Submit agents that compete and make $20K/day (ridges.ai)

Show HN: Inworld Runtime – A C++ graph-based runtime for production AI apps (inworld.ai)

Show HN: GitChamber – list, read and search GitHub repos without rate limits (gitchamber.com)

The one-liner for max-width, centering, and margins (frontendmasters.com)

Step Away from Share Button (stepawayfromthesharebutton.com)

Why Your Stimulant "Stopped Working" (and What's Going On) (psychofarm.substack.com)

What Is It Like to Be a Bot? [pdf] (keithfrankish.github.io)

Air Canada starts shutting down (aircanada.com)

Sam Altman was wrong: AI didn't defeat auth. Single factors did (stytch.com)

Everything I Know about Self-Publishing (kk.org)

Gartner's Grift Is About to Unravel (dx.tips)

External Secrets Operator to pause releases, needs additional maintainers (old.reddit.com)

Lessons learned from implementing SIMD-accelerated algorithms in pure Rust (kerkour.com)

What are Forward Deployed Engineers, and why are they so in demand? (newsletter.pragmaticengineer.com)

Doorway Effect (en.wikipedia.org)

Air-Gapping and Authentication (fusionauth.io)

Nginx Introduces Native Support for Acme Protocol (blog.nginx.org)

How Well Do Coding Agents Use Your Library?

Comments (16)

Ask HN: Going to bed with unsolved problems in your head?