Anatomy of a Recruitment Phish (blog.amen6.com)

The result was using deep DOM parsing and a semantic abstraction layer to transform websites into structured, navigable maps described in NL. Instead of feeding raw HTML, there is a perception layer that means LLMs don't just click the DOM elements, but understand the intent behind them.

I benchmarked it against other agent frameworks and was pleasantly surprised - faster task completion and increased reliability (all open source with replayable/reproducible code).

Beyond the core tech, I also built out unified session management, stealth features, credentials vault, CAPTCHA HITL + some more cool features all via a single API. Still working out some edge cases with dynamic content, but it's been solid for most real-world tasks.

Github:https://github.com/nottelabs/notte

Benchmarks:https://github.com/nottelabs/open-operator-evals

Docs:https://docs.notte.cc/side/introduction/what-is-notte

Some questions for the HN community: What's your biggest pain point with current web automation? Is there anything required before you’d try or trust it on certain workflows? Anyone else tried preprocessing web content for LLMs? Curious what approaches have worked for you.

Thanks for checking it out.

— Lucas

Comments (0)

No comments yet

Anatomy of a Recruitment Phish (blog.amen6.com)

Some Dead Sea Scrolls are older than researchers thought, AI analysis suggests (science.org)

An Arrow Flight SQL Server with DuckDB or SQLite back-end execution engines (github.com)

What was the role of MS-DOS in Windows 95? (devblogs.microsoft.com)

American Utopian Communities of the 20th Century (defunct and active) (en.wikipedia.org)

British canoeist 'forced to choose between Olympics and OnlyFans' (bbc.com)

Agency is giving away $15k Branding package in a contest (alisteragency.ca)

Handling user's data migration with Debezium and Kafka (medium.com)

Show HN: Build Stateful JavaScript Workflows with Codehooks' New API (codehooks.io)

Review: At $349, AMD's 16GB Radeon RX 9060 XT is the new midrange GPU to beat (arstechnica.com)

No AI, no job. These companies are requiring workers to use the tech (washingtonpost.com)

Nate B Jones Summary of Mary Meeker's 340 Page 2025 AI Trends Deck (natesnewsletter.substack.com)

Reddit sues Anthropic for scraping site content to train Claude (the-decoder.com)

World War II bombs trigger evacuation in Cologne (politico.eu)

Trump admin threatens Columbia's accreditation (cnn.com)

Maternal iron deficiency causes male-to-female sex reversal in mouse embryos (nature.com)

The Vampire Diary (arxiv.org)

Postgres CDC connector for ClickPipes is now Generally Available (clickhouse.com)

DreamWorks co-founder Katzenberg likens AI to CGI revolution (axios.com)

AI Powered Deal Flow Discovery (osly.ghost.io)

Is there some way to better understand TAOCP Vol 1 by Knuth?

MongoDB, Inc. v. FerretDB, Inc. [pdf] (storage.courtlistener.com)

Anthropically Blind: the anthropic shadow is reflectively inconsistent (2023) (lesswrong.com)

Dijkstra on Mathematical Notation [pdf] (cs.utexas.edu)

Cursor 1.0 (cursor.com)

No certainty of a Milky Way–Andromeda collision (nature.com)

Generating High-Performance Tensor Operators with Hardware Primitives (arxiv.org)

Building Modulewise (modulewise.com)

Worth Reading- Gen Z, millennials: A college degree is a waste of money and time (mikemcbrideonline.com)

VectorSmuggle: Covertly Exfiltrate Data in Embeddings (github.com)

Show HN: Workflows for getting Nvidia drivers working on Linux (skushagra.com)

The Witcher 4 Unreal Engine 5 Tech Demo: Digital Foundry Reaction [video] (youtube.com)

The Witcher 4 – Official UE 5.6 Tech Demo (4K) – State of Unreal 2025 [video] (youtube.com)

Reddit sues Anthropic for allegedly not paying for training data (techcrunch.com)

The post I'm not supposed to write – how we are silencing victims online (childabusesurvivor.net)

Turkish Airlines announces its official MCP server (mcp.turkishtechlab.com)

Cursor Release v1.0 (cursor.com)

How to "vibe code" properly. (skushagra.com)

Bland.ai TTS Engine (bland.ai)

Amelia Earhart's Reckless Final Flights (newyorker.com)

Semaglutide Plus Trevogrumab Combo Superior Fat Loss with Reduced Muscle Wasting (pharmexec.com)

Former DOGE engineer on his experience working for the cost-cutting unit (npr.org)

Gg: GG – GUI for JJ (github.com)

The Works of Peter Naur (naur.com)

GlobalFoundries announces $16B U.S. chip production spend (tomshardware.com)

Can We Afford Large-Scale Solar PV? – By Brian Potter (construction-physics.com)

American Science and Surplus is fighting for its life (arstechnica.com)

The importance of free software to science (lwn.net)

Fakespot will shut down on July 1 (fakespot.com)

Reddit sues Anthropic over data access (nytimes.com)

Show HN: I made web agents reliable with smaller LLMs via natural language

Comments (0)