Show HN: Index – New Open Source browser agent

98 skull8888888 45 4/23/2025, 4:11:11 PM github.com ↗

Hey HN, Robert from Laminar (lmnr.ai) here.

We built Index - new SOTA Open Source browser agent.

It reached 92% on WebVoyager with Claude 3.7 (extended thinking). o1 was used as a judge, also we manually double checked the judge.

At the core is same old idea - run simple JS script in the browser to identify interactable elements -> draw bounding boxes around them on a screenshot of a browser window -> feed it to the LLM.

What made Index so good:

1. We essentially created browser agent observability. We patched Playwright to record the entire browser session while the agent operates, simultaneously tracing all agent steps and LLM calls. Then we synchronized everything in the UI, creating an unparalleled debugging experience. This allowed us to pinpoint exactly where the agent fails by seeing what it "sees" in session replay alongside execution traces.

2. Our detection script is simple but extremely good. It's carefully crafted via trial and error. We also employed CV and OCR.

3. Agent is very simple, literally just a while loop. All power comes from carefully crafted prompt and ton of eval runs.

Index is a simple python package. It also comes with a beautiful CLI.

pip install lmnr-index

playwright install chromium

index run

We've recently added o4-mini, Gemini 2.5 Pro and Flash. Pro is extremely good and fast. Give it a try via CLI.

You can also use index via serverless API. (https://docs.lmnr.ai/index-agent/api/getting-started)

Or via chat UI - https://lmnr.ai/chat.

To learn more about browser agent observability and evals check out open-source repo (https://github.com/lmnr-ai/lmnr) and our docs (https://docs.lmnr.ai/tracing/browser-agent-observability).

Comments (45)

androng · 76d ago

Can it actually do something difficult like apply for jobs? So far I know of five or so websites that claim they can apply to jobs for you like sonara.ai and usemassive.com and Skyvern AI but when you try to actually use them all they can do is the one-page job applications and not the much more common Workday 10-page job applications with annoying "create an account" and annoying questions like "Do you have any relatives that work at Sony" and annoying "fill out all your work experience" where you have to click 50 times for one application. That's like half of all job applications. https://jobs.spectrum.com/job/-/-/4673/76746020384?utm_sourc...

skull8888888 · 76d ago

I'm pretty confident it can do it. Try it out and see for yourself. Just install the package, run cli and give it your prompt.

pip install lmnr-index playwright install chromium index run

Also try experimenting with different models. So far, Gemini 2.5 Pro is the best in terms of quality/speed. Claude 3.7 is also pretty good.

naim08 · 76d ago

shit, let me try it out

rushils · 76d ago

while we don't auto-apply to jobs for you, our browser extension, Simplify Copilot, makes it easier to apply to those multi-step application forms (workday, taleo, sap, etc.)

https://simplify.jobs/install

skull8888888 · 76d ago

any need for browser agent observability?

esafak · 76d ago

I consider using bad hiring software like that a red flag, and suggestive of other things the company must be doing wrong too. I noped out whenever I saw Taleo.

globalnode · 76d ago

All big successful companies do "something" wrong, thats how they make money. Steal your OSS, not pay taxes, avoid overtime payments, low wages, outsource slavery, destroy the environment, gaslight while they steal your data and subject millions to dark patterns of advertising and marketing, screw over suppliers, intentionally sew discord as a distraction. The list goes on. To me the bigger the company the bigger the red flag.

tomdekan · 76d ago

That’s a very cynical view.

Do the biggest companies not create the most value for the world?

Consider this. If the most successful companies are simply cheating customers, then most consumers are stupid; handing offer their hard-earned money for bad deals and to be exploited.

But most people are not stupid, and most people highly value their money. So, they only buy something because they want what the seller is offering even more than their money. This means that companies create great value because they offer something that people really want.

noooooooph · 75d ago

This assumes there are compelling alternatives in the market that I can choose from. In reality, there are only a few entrenched players in any established market that work hard to limit competition. So yes, even if I'd like to choose not to hand over my hard-earned money to Evil Corp #93, I can only do the "stupid" thing and watch myself and my environment get exploited.

tomdekan · 75d ago

Also, I don’t think my previous comment does assume that there are compelling alternatives.

A person always has a choice not to spend their money. Even if they need expensive healthcare, they can choose not to buy it. By buying the product, they want the service more than their money.

They might think that the price is too high, but prices are a function of market forces.

It doesn’t make sense to me that a person can say they feel exploited because they have voluntarily chosen to buy at a particular price. They probably want to pay less, and might feel that the consumer surplus is low, but they still value the service more than their money. That isn’t exploitation to me.

tomdekan · 75d ago

I disagree with the general comment that “any established market” has “only a few entrenched players”. I’d say that most markets provide compelling alternatives. Where they don’t yet, the product is either a commodity, or there is an opportunity for a new business to serve the customer!

But let’s say your point is true. How do those players become entrenched? I’d say it’s from providing great value.

mulmboy · 76d ago

Nice.

Can run with `uvx --from lmnr-index --python 3.12 index run`

hackerknew · 76d ago

How well does it work with bank websites with non-conventional multi-click logins that sometimes include an "important message" that you have to click through just to get your balance?

skull8888888 · 75d ago

works pretty well, try it out - pip install lmnr-index

methyl · 75d ago

How do you force gemini or any other model to actually login to the bank system? Anything I try, I end up with "I can't do that as it's sensitive"

skull8888888 · 75d ago

have you tried it with Index?

noleary · 76d ago

> Index is the SOTA open-source browser agent for autonomously executing complex tasks on the web.

I've written a handful of pretty hacky Python scripts that just pull down all of the HTML content from a page and toss it over to OpenAI. As you can imagine, these were all extremely simple tasks, e.g., "find out if there's a login button"

What's a good example of a complex task that Index is well-suited for? What's the threshold of minimal complexity where you guys are a really good fit?

skull8888888 · 76d ago

- research task, agent is smart enough to understand which links to click next without the need to hardcode the parsing and navigation logic

- any task that requires UI interaction, button clicking, filter selection, form filling and so on. Just prompt it, it's surprisingly very robust and self-healing.

- complex long-running task that require extensive context - e.g. researching one topic and then creating spreadsheet, creating a presentation for a topic and so on.

Essentially, any task that can be done within a browser environment that previously required flacky hardcoded predefined scripts. Also, website testing is a great example.

nico · 76d ago

Would love to see it doing some work on a Google spreadsheet (including doing formulas, vlookups, data import and cleanup) and then creating a decent Slides presentation with some charts from the spreadsheet

skull8888888 · 75d ago

it can do it! try it out, literally just prompt it

shekhar101 · 76d ago

Can you open up the options to use other model/versions, especially Gemini-2.5 pro experimental models available through aistudio? Would love to try this but gemini flash fails for even simple tasks. Example: I asked it to extract all the links from comment section of a hackernews comment section and it just scrolled all the way to the end and then nothing. Maybe pro models can do it better.

Yiling-J · 76d ago

"Gemini Flash fails even for simple tasks." On the Gemini Flash page (https://deepmind.google/technologies/gemini/flash/), it claims to be 'best for fast performance on complex tasks.'. I always use Gemini Flash in my project for demos and testing, and it performs very well, if a project requires a large, expensive model to handle simple tasks, that could be an issue to users.

skull8888888 · 76d ago

Gemini 2.5 pro is available. Is it missing on your side? Do you run index via CLI?

shekhar101 · 76d ago

Yes it is, however API keys from aistudio only allows pro-experimental model. So if I select gemini-pro, I will see this: "Gemini 2.5 Pro Preview doesn't have a free quota tier. Please use Gemini 2.5 Pro Experimental (models/gemini-2.5-pro-exp-03-25) instead". Can I choose exact model somewhere in the CLI?

skull8888888 · 76d ago

Oh I see, didn't know about that, fastest and easiest thing you can do is to play around with pro via our chat UI https://lmnr.ai/chat - it's free up to 10 messages.

For the CLI and custom models, you can clone the repo, then go to the cli.py and manually add your model there. I will work on proper support of custom models.

naim08 · 76d ago

extremely slow

skull8888888 · 76d ago

which model are you using? try gemini pro/flash, they are very fast

jrvarela56 · 76d ago

My first reaction was to look for MCP server so that I could connect it to Cursor. Just pointing this out in case it helps with new user onboarding. MCP server would work to hook it up to the Claude Desktop Website and most agentic-IDEs (Cursor, Cline, Roo, Windsurf, etc).

skull8888888 · 75d ago

thank you for the feedback! we're actually working on it :)

badmonster · 76d ago

What’s the most surprising or complex real-world task you’ve seen it succeed at so far?

skull8888888 · 75d ago

researching a topic and creating a spreadsheet

simba-k · 75d ago

I feel like I see a new company doing this every week. I know there is Skyvern and browser-use in particular. Is there something special about this one?

skull8888888 · 75d ago

The best out all of them. Check the first message in the post. tldr:

- SOTA on webvoyager

- browser agent observability

- fast and reliable

- CLI for easier interaction

- available as a serverless API

lostmsu · 75d ago

Can I switch it to use my own models? Would it work with Gemma 3? Is vision required (Gemma 3 has it, but unsure if it supports coordinates)?

skull8888888 · 75d ago

right now package only supports models from gemini, anthropic and openai. Vision is required. PRs are very welcome! It's very easy to add new model, simply follow any providers in here https://github.com/lmnr-ai/index/tree/main/index/llm/provide...

omerhefets · 72d ago

How do you perform actions with this agent? Puppeteer / playwright session?

skull8888888 · 72d ago

playwright

xena · 76d ago

How do I block it from my services? Does it obey robots.txt?

lolinder · 76d ago

This is a user agent, not a recursive web crawler, so robots.txt explicitly does not apply [0]. As long as I'm not abusing your services you shouldn't care if my instructions to my user agent come in the form of a click of a button, a keyboard shortcut, a script, or an LLM agent.

If it's abusive behavior you are worried about you should be able to detect and block it with rate limits or other tools that target the malicious behavior. If you can't distinguish between my usage and a regular browser then I'm not sure what moral ground you have to claim my usage is hurting you.

[0] "A robot is a program that automatically traverses the Web's hypertext structure by retrieving a document, and recursively retrieving all documents that are referenced." https://www.robotstxt.org/faq/what.html

CaptainFever · 76d ago

"That's the neat part, you don't."

If I wanted to use this to do my personal browsing for me, like checking for website updates on those where RSS does not exist, you shouldn't be able to stop me.

keyle · 76d ago

Impressive and potentially very interesting future work.

One thing I couldn't help but notice was the crazy amount of HTTP requests going on in the demo on the github readme page, and the video looks to be sped up.

I'm all for AI assisting but I wouldn't want to create even 1/10th of these HTTP requests, as a good netizen; unless I'm missing the point.

skull8888888 · 75d ago

how could you see an http request from a video? If you mean the console output, then it's just logs of an agent.

purplecats · 76d ago

got a video demo that isn't behind a sign up wall?

skull8888888 · 76d ago

here's a demo of a chat UI https://x.com/skull8888888888/status/1910763169489764374

here's a demo of CLI https://x.com/skull8888888888/status/1914728292193628330

skull8888888 · 76d ago

there's also a demo right in the repo https://github.com/lmnr-ai/index

Ask HN: What Problem Would You Solve with Unlimited Resources?

Ask HN: Do you think a new alternative to MCP would be useful?

Pocket LLM Server Just Like a Pocket WiFi

Ask HN: How did Soham Parekh get so many jobs?

Ask HN: People who work different timezones than your company. How sched?

Ask HN: What are some cool or underrated tech companies based in Australia?

Ask HN: How are you making money on the side?

Ask HN: What are some cool or underrated tech companies based in Canada?

Ask HN: How is the tech scene in LA?

Ask HN: Has anyone else learned English just by reading tech posts (like HN)?

Ask HN: What's the verdict on GPT wrapper companies these days?

Ask HN: What Are You Working On? (June 2025)

Agentic terminology doesn't make any sense

Ask HN: Any resources for finding non-smart appliances?

Ask HN: Freelancer? Seeking freelancer? (July 2025)

N8n AI Workflows – 3,400 Workflows and an LLM Prototype

Ask HN: Who is hiring? (July 2025)

Ask HN: Worth leaving position over push to adopt vibe coding?

Ask HN: Do you use LLM for HTML translations?

Are there any noteworthy LinkedIn alternatives?

Ask HN: What inspires you to persevere through adversity?

Ask HN: Who wants to be hired? (July 2025)

Ask HN: Took a break after burnout – what now?

Ask HN: Advice for Starting a Hacker Space?

Ask HN: What's the 2025 stack for a self-hosted photo library with local AI?

Slack is just the worst – and I've used a BBS and 14.4k modem

Proposal: GUI-first, text-based mechanical CAD inspired by software engineering

Ask HN: What happened to W3C's PROV initiative to add provenance to the Web?

Ask HN: Brick and Mortar Dev Agency

Which email clients work well with keyboard shortcuts?

Ask HN: How do you deal with data backups in servers?

Ask HN: HN was much more interesting a year ago

Ask HN: How many communities HN it devs in C language?

Ask HN: What's the greatest piece of non-dogfooded software?

1KB JavaScript Demoscene Challenge Just Launched

Super Simple "Hallucination Traps" to detect interview cheaters

Ask HN: Is there a business for extracting US tech talent?

Why did not numpy copy the J rank concept?

Ask HN: What old or outdated software have you never found a replacement for?

Looking for Early Testers for a AI Assistant Inside Zotero

Ask HN: MiniNAS Experience

Ask HN: How do I buy a typewriter?

mTLS vs. HTTP Message Signatures: Tradeoffs in Securing HTTP Requests

Ask HN: How do you sell to B2B in current state of AI?

Ask HN: How to generate product docs E2E?

Tell HN: A fake, highly obfuscated Solidity VSCode plugin found on marketplace

Ask HN: Why there is no demand for my SaaS when competition is killing it?

ARZY-G: A token born from AI-validated usefulness (not mined, not bought)

Ask HN: What are the best resources to help with health insurance denials?

If Emacs is not a text editor, then what is it really?

Show HN: Index – New Open Source browser agent

Comments (45)