Vanilla JavaScript support for Tailwind Plus (tailwindcss.com)

126 points by ulrischa 2h ago 43 comments

It's a DE9, not a DB9 (but we know what you mean) (news.sparkfun.com)

268 points by jgrahamc 7h ago 184 comments

Experimental surgery performed by AI-driven surgical robot (arstechnica.com)

5 points by horseradish 14m ago 2 comments

Why MIT switched from Scheme to Python (2009) (wisdomandwonder.com)

113 points by borski 4h ago 110 comments

Never write your own date parsing library (zachleat.com)

83 points by ulrischa 3h ago 100 comments

Efficient Computer's Electron E1 CPU – 100x more efficient than Arm? (morethanmoore.substack.com)

81 points by rpiguy 4h ago 31 comments

Internet Archive is now a federal depository library (kqed.org)

144 points by XnoiVeX 3h ago 25 comments

Programming vehicles in games (wassimulator.com)

196 points by Bogdanp 6h ago 47 comments

Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope (wired.com)

218 points by 6d6b73 4h ago 263 comments

Developing our position on AI (recurse.com)

78 points by jakelazaroff 2d ago 16 comments

How to Catch a Wily Poacher in a Sting: A Thermal Robotic Deer (wsj.com)

14 points by Element_ 1d ago 17 comments

Implementing a functional language with graph reduction (2021) (thma.github.io)

36 points by Bogdanp 3h ago 2 comments

Why Is There a Date of 1968 in the Intel Chipset Device Software Utility? (intel.com)

11 points by vegadw 2d ago 2 comments

The future is not self-hosted (drewlyton.com)

156 points by drew_lytle 8h ago 185 comments

Women dating safety app 'Tea' breached, users' IDs posted to 4chan (404media.co)

201 points by gloxkiqcza 5h ago 284 comments

Show HN: Price Per Token – LLM API Pricing Data (pricepertoken.com)

253 points by alexellman 8h ago 114 comments

Steve Jobs' cabinet (perfectdays23.substack.com)

24 points by padraigf 2d ago 13 comments

Who has the fastest F1 website (2021) (jakearchibald.com)

168 points by tosh 7h ago 55 comments

WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding (arxiv.org)

24 points by wut42 3h ago 2 comments

Trucking's uneasy relationship with new tech (bbc.com)

16 points by fidotron 3d ago 2 comments

Windsurf employee #2: I was given a payout of only 1% what my shares where worth (twitter.com)

205 points by rfurmani 1d ago 105 comments

Show HN: Apple Health MCP Server (github.com)

134 points by _neil 2d ago 28 comments

How to draw lambda diagrams (2020) (risingentropy.com)

12 points by diginova 7h ago 4 comments

Dwl: Dwm for Wayland (codeberg.org)

85 points by theycallhermax 7h ago 62 comments

Researchers value null results, but struggle to publish them (nature.com)

52 points by Bluestein 2d ago 23 comments

How to configure X11 in a simple way (eugene-andrienko.com)

27 points by speckx 3h ago 15 comments

Google in 1999: Search Engines Escape the Portal Matrix (cybercultural.com)

6 points by speckx 42m ago 5 comments

The Tabs vs. Spaces war is over, and spaces have emerged victorious (xn--gckvb8fzb.com)

49 points by ChiptuneIsCool 2h ago 107 comments

Nullable but not null (efe.me)

48 points by efeoge 5h ago 29 comments

Monotonic and wall clock time in the Go time package (victoriametrics.com)

13 points by valyala 2h ago 6 comments

Celebrating 20 Years of MDN (developer.mozilla.org)

352 points by soheilpro 19h ago 52 comments

Quantitative AI progress needs accurate and transparent evaluation (mathstodon.xyz)

192 points by bertman 14h ago 95 comments

Tamiya chairman Shunsaku Tamiya dies at 90 (dailyexpress.com.my)

14 points by mbrd 2d ago 8 comments

The sad state of font rendering on Linux (2018) (pandasauce.org)

62 points by harporoeder 3h ago 76 comments

Show HN: The Montana MiniComputer (mtmc.cs.montana.edu)

88 points by recursivedoubts 6h ago 20 comments

Stackless Traversal (2018) (dyalog.com)

3 points by ofalkaed 1h ago 0 comments

Quantum Scientists Have Built a New Math of Cryptography (quantamagazine.org)

24 points by DocFeind 5h ago 7 comments

Claude Code Introduces Specialized Sub-Agents (docs.anthropic.com)

5 points by tekkertje 2h ago 0 comments

Games Look Bad: HDR and Tone Mapping (2017) (ventspace.wordpress.com)

174 points by uncircle 13h ago 187 comments

Building Brain Box, a meta text adventure film adaptation (kubicki.org)

8 points by kosmavision 1h ago 0 comments

Google spoofed via DKIM replay attack: A technical breakdown (easydmarc.com)

267 points by frasermarlow 15h ago 92 comments

The Mythical Machine-Month Paradox – How much could AI change programming? (tucson-josh.com)

13 points by tucson-josh 3h ago 5 comments

Brazil central bank to launch Pix installment feature in September (reuters.com)

117 points by CXSHNGCB 4d ago 142 comments

Asciinema: Record and share your terminal sessions (asciinema.org)

227 points by phendrenad2 17h ago 60 comments

Google's shortened goo.gl links will stop working next month (theverge.com)

175 points by mobilio 6h ago 154 comments

High-speed organic light-emitting diodes achieving 4-Gbps communication (spiedigitallibrary.org)

35 points by domofutu 4d ago 2 comments

Lisp project of the day (40ants.com)

67 points by perihelions 9h ago 20 comments

A valid HTML zip bomb

137 Bogdanp 36 7/24/2025, 1:16:29 PM ache.one ↗

Comments (36)

bhaney · 1d ago

Neat approach. I make my anti-crawler HTML zip bombs like this:

    (echo '<html><head></head><body>' && yes "<div>") | dd bs=1M count=10240 iflag=fullblock | gzip > bomb.html.gz

So they're just billions of nested div tags. Compresses just as well as repeated-single-character bombs in my experience.

pyman · 1d ago

This is a great idea.

LLM crawlers are ignoring robots.txt, breaching site terms of service, and ingesting copyrighted data for training without a licence.

We need more ideas like this!

bhaney · 1d ago

This is the same idea as in the article, just an alternative flavor of generating the zip bomb.

And I actually only serve this to exploit scanners, not LLM crawlers.

I've run a lot of websites for a long time, and I've never seen a legitimate LLM crawler ignore robots.txt. I've seen reports of that, but any time I've had a chance to look into it, it's been one of:

- The site's robots.txt didn't actually say what the author thought they had made it say

- The crawler had nothing to with the crawler it was claiming to be, it just hijacked a user agent to deflect blame

It would be pretty weird, after all, for a company running a crawler to ignore robots.txt with hostile intent while also choosing to accurately ID itself to its victim.

shakna · 23h ago

Perplexity certainly was ignoring robots.txt [0]

Anthropic... Their robots.txt requires a delay to be defined, even though its an optional extension. But whatever.

[0] https://www.wired.com/story/perplexity-is-a-bullshit-machine...

pyman · 22h ago

There's plenty of evidence to the contrary;

https://mjtsai.com/blog/2024/06/24/ai-companies-ignoring-rob...

_ache_ · 1d ago

Nice command line.

PeterStuer · 1d ago

For every 1 robots.txt that is genuinly configured, there's 9 that make absolutely no sense at all.

Worse. GETing the robots.txt automatically flags you as a 'bot'!

So as a crawler that wants to respect the spirit of the robots.txt, not the inane letter that your hired cheapest junior webadmin copy/pasted there from some reddit comment, we now have to jump through hoops such as geeting hhe robots.txt from a separate vpn etc.

Grimblewald · 21h ago

Well, robots.txt being an opaque and opt out system was broken from the start. I've just started havi g hidden links and pages only mentioned in robots.txt and any ip that tries those is immediatly blocked for 24 hours. There is no reason to continue entertaining these companies.

andrew_eu · 1d ago

I can imagine the large scale web scrapers just avoid processing comments entirely, so while they may unzip the bomb it could be they just discard the chunks that are inside of a comment. The same trick could be applied to other elements in the HTML though: semicolons in the style tag, some gigantic constant in inline JS, etc. If the HTML itself contained a gigantic tree of links to other zip bombs that could also have an amplifying effect on the bad scraper.

_ache_ · 1d ago

There is definitively improvements that can be made. The comment part is more about aesthetic as it is not needed actually, you could have just put the zip chunk in a `div`, I guess.

chatmasta · 1d ago

Note: the submission link is not the zip bomb. It’s safe to click.

abirch · 1d ago

Sounds like something a person linking to a zip bomb would say :-D

slig · 1d ago

If you try to do that on a site with Cloudflare, what happens? Do they read the zip file and try to cache the uncompressed content to serve it with the best compression algorithm for a given client, or do they cache the compressed file and serve it "as is"?

bhaney · 1d ago

If you're doing this through cloudflare, you'll want to add the response header

    cache-control: no-transform

so you don't bomb cloudflare when they naturally try to decompress your document, parse it, and recompress it with whatever methods the client prefers.

That being said, you can bomb cloudflare without significant issue. It's probably a huge waste of resources for them, but they correctly handle it. I've never seen cloudflare give up before the end-client does.

uxjw · 23h ago

Cloudflare has free AI Labyrinths if your goal is to target AI. The bots follow hidden links to a maze of unrelated content, and Cloudflare uses this to identify bots. https://blog.cloudflare.com/ai-labyrinth/

cyanydeez · 21h ago

Do you think Meta AI's llama 4 failed so badly cause they ended up crawling a bunch of labrynths?

Alifatisk · 10h ago

I dislike that the websites sidebar all of sudden collapses during scrolling, it shifts all the content to the left in middle of reading

fdomingues · 10h ago

That content shift on page scroll is horrendous. Please don't do that, there is no need to auto hide a side bar.

Telemakhos · 1d ago

Safari 18.5 (macOS) throws an error WebKitErrorDomain: 300.

can16358p · 1d ago

Crashing Safari on iOS (not technically crashing the whole app, but the tab displays internal WebKit error).

cooprh · 1d ago

Crashed 1password on safari haha

xd1936 · 1d ago

Risky click

ranger_danger · 1d ago

Did not crash Firefox nor Chrome for me on Linux.

AndrewThrowaway · 7h ago

Crashed Chrome tab on Windows instantly but Firefox is fine. It shows loading but pressing Ctrl + U even shows the very start of that fake HTML.

_ache_ · 1d ago

Perhaps you have very generous limits on RAM allocation per thread. I have 32GB, 128 with swap and still crash (silently on Firefox and with a dedicated screen on Chrome).

throwaway127482 · 1d ago

Out of curiosity, how do you set these limits? I'm not the person you're replying to, but I'm just using the default limits that ship with Ubuntu 22.04

_ache_ · 1d ago

Usually in /etc/limits.conf. The field `as` for address space will be my guess, but I not sure, maybe `data`. The man page `man limits.conf` isn't very descriptive.

inetknght · 1d ago

> The man page `man limits.conf` isn't very descriptive.

Looks to me like it's quite descriptive. What information do you think is missing?

https://www.man7.org/linux/man-pages/man5/limits.conf.5.html

_ache_ · 16h ago

What is `data` ? "maximum data size (KB)". Is `address space limit (KB)` virtual or physical ?

What is maximum filesize in a context of a process ?! I mean what happens if a file is bigger ? Maybe it can't write bigger file than that, maybe it can't execute file bigger than that.

I have a bunch of questions.

palmfacehn · 1d ago

Try creating one with deeply nested tags. Recursively adding more nodes via scripting is another memory waster. From there you might consider additional changes to the CSS that cause the document to repaint.

meinersbur · 1d ago

It will also compress worse, making it less like a zip bomb and more like a huge document. Nothing against that, but the article's trick is just to stop a parser to bail early.

palmfacehn · 1d ago

For my usage, the compressed size difference with deeply nested divs was negligible.

esperent · 1d ago

It crashed the tab in Brave on Android for me.

johnisgood · 1d ago

It crashed the tab on Vivaldi (Linux).

Tepix · 1d ago

Imagine you‘re a crawler operator. Do you really have a problem with documents like this? I don’t think so.

ChrisArchitect · 1d ago

Fun with gzip bombs and email clients

https://news.ycombinator.com/item?id=44651536

Vanilla JavaScript support for Tailwind Plus (tailwindcss.com)

It's a DE9, not a DB9 (but we know what you mean) (news.sparkfun.com)

Experimental surgery performed by AI-driven surgical robot (arstechnica.com)

Why MIT switched from Scheme to Python (2009) (wisdomandwonder.com)

Never write your own date parsing library (zachleat.com)

Animated Cursors (tattoy.sh)

Efficient Computer's Electron E1 CPU – 100x more efficient than Arm? (morethanmoore.substack.com)

CO2 Battery (energydome.com)

Internet Archive is now a federal depository library (kqed.org)

Programming vehicles in games (wassimulator.com)

Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope (wired.com)

Developing our position on AI (recurse.com)

How to Catch a Wily Poacher in a Sting: A Thermal Robotic Deer (wsj.com)

Implementing a functional language with graph reduction (2021) (thma.github.io)

Why Is There a Date of 1968 in the Intel Chipset Device Software Utility? (intel.com)

The future is not self-hosted (drewlyton.com)

Women dating safety app 'Tea' breached, users' IDs posted to 4chan (404media.co)

Show HN: Price Per Token – LLM API Pricing Data (pricepertoken.com)

Steve Jobs' cabinet (perfectdays23.substack.com)

Who has the fastest F1 website (2021) (jakearchibald.com)

WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding (arxiv.org)

Trucking's uneasy relationship with new tech (bbc.com)

Windsurf employee #2: I was given a payout of only 1% what my shares where worth (twitter.com)

Show HN: Apple Health MCP Server (github.com)

How to draw lambda diagrams (2020) (risingentropy.com)

Dwl: Dwm for Wayland (codeberg.org)

Researchers value null results, but struggle to publish them (nature.com)

How to configure X11 in a simple way (eugene-andrienko.com)

Google in 1999: Search Engines Escape the Portal Matrix (cybercultural.com)

The Tabs vs. Spaces war is over, and spaces have emerged victorious (xn--gckvb8fzb.com)

Nullable but not null (efe.me)

Monotonic and wall clock time in the Go time package (victoriametrics.com)

Celebrating 20 Years of MDN (developer.mozilla.org)

Quantitative AI progress needs accurate and transparent evaluation (mathstodon.xyz)

Tamiya chairman Shunsaku Tamiya dies at 90 (dailyexpress.com.my)

The sad state of font rendering on Linux (2018) (pandasauce.org)

Show HN: The Montana MiniComputer (mtmc.cs.montana.edu)

Stackless Traversal (2018) (dyalog.com)

Quantum Scientists Have Built a New Math of Cryptography (quantamagazine.org)

Claude Code Introduces Specialized Sub-Agents (docs.anthropic.com)

Games Look Bad: HDR and Tone Mapping (2017) (ventspace.wordpress.com)

Building Brain Box, a meta text adventure film adaptation (kubicki.org)

Google spoofed via DKIM replay attack: A technical breakdown (easydmarc.com)

The Mythical Machine-Month Paradox – How much could AI change programming? (tucson-josh.com)

Brazil central bank to launch Pix installment feature in September (reuters.com)

Asciinema: Record and share your terminal sessions (asciinema.org)

Google's shortened goo.gl links will stop working next month (theverge.com)

High-speed organic light-emitting diodes achieving 4-Gbps communication (spiedigitallibrary.org)

Dwm Commented (github.com)

Lisp project of the day (40ants.com)

A valid HTML zip bomb

Comments (36)