Technical Interviews are realigning with reality through AI (cendyne.dev)

1 points by furkansahin 22s ago 0 comments

GPT5 Is Horrible (old.reddit.com)

1 points by druskacik 1m ago 0 comments

Robots.txt for the AI Era but Enforceable (aiprivacylicense.com)

1 points by nabanita 6m ago 1 comments

Air Force buying two Tesla Cybertrucks so it can learn to destroy them (theregister.com)

2 points by ndsipa_pomu 11m ago 1 comments

Agentic Workflow: What's inside RAGFlow v0.20.0 (medium.com)

1 points by vissidarte_choi 12m ago 0 comments

defer-import-eval: proposal for introducing a way to defer evaluate of a module (github.com)

1 points by tilt 14m ago 0 comments

OpenAI is taking GPT-4o away from me – despite promising they wouldn't (community.openai.com)

1 points by Mzxr 18m ago 0 comments

Show HN: Streamed JSON Lines (A big JSON can't be streamed) (medium.com)

1 points by marius-ciclistu 22m ago 0 comments

Bsky Tracker: v2.5.0 Is Live (bsky.app)

1 points by pavlostze 23m ago 1 comments

Malicious Ruby Gems Used in Targeted Credential Theft Campaign (socket.dev)

1 points by amalinovic 36m ago 0 comments

HBO Max is going to get more annoying about password sharing (theverge.com)

2 points by tosh 37m ago 0 comments

Ask HN: GPT-5 still needs a second nudge for calculation?

1 points by chandlertsien 39m ago 0 comments

EU Artificial Intelligence Act (artificialintelligenceact.eu)

1 points by jonbaer 41m ago 0 comments

White Mountain Direttissima (whitemountainski.co)

1 points by oftenwrong 41m ago 0 comments

Linux Desktop Share Tops 6% in 15M-System Analysis (zdnet.com)

2 points by naves 46m ago 0 comments

OpenAI CEO Sam Altman says GPT-5 scares him – 'what have we done?' (tomsguide.com)

2 points by pera 48m ago 1 comments

Tokenization in Large Language Models (seantrott.substack.com)

2 points by tokfan 51m ago 0 comments

Comuniq – A lightweight space for publishing and discussing specific topics

1 points by 01-_- 53m ago 0 comments

Nature study on economic damages from climate change revised (pik-potsdam.de)

1 points by 01-_- 54m ago 0 comments

ChatGPT5 can't answer "How many states have R in it's name?" (bsky.app)

4 points by mattigames 56m ago 1 comments

How to Run Your Own OpenAI GPT OSS Server for Fun and Profit (northcodie.blogspot.com)

4 points by nickly 57m ago 2 comments

Understanding Late Binding in Python Closures (pythonkoans.substack.com)

2 points by meander_water 58m ago 0 comments

Loyalty programmes are keeping America's airlines aloft (economist.com)

2 points by jmsflknr 59m ago 0 comments

US Adds Surprise Gold Bar Tariff in Blow to Switzerland (bloomberg.com)

5 points by petethomas 1h ago 0 comments

Can't disable copilot code reviews (github.com)

2 points by TonyTrapp 1h ago 0 comments

Ask HN: What is "response-level error rate" and how is it measured?

2 points by myyke 1h ago 0 comments

VS Code Tools That Supercharge Your Workflow (jsdevspace.substack.com)

3 points by javatuts 1h ago 1 comments

Gigabit Ethernet over Telephone Lines (tripletime.de)

1 points by lis 1h ago 0 comments

Nonlinear Attention Decoded – Verified by 3 AI Systems (zenodo.org)

1 points by GhostDrift 1h ago 1 comments

XGBoost 3.0: Training Terabyte-Scale Datasets with Grace Hopper Superchip (thedailypeac.blogspot.com)

3 points by miclys 1h ago 0 comments

Zeppe-Lin 1.1 – A Minimal Source-Based Linux Distro (Crux Fork) (zeppe-lin.github.io)

1 points by sighook 1h ago 1 comments

Leo (savewithleo.com)

1 points by BryanHoulton 1h ago 0 comments

Writing Surtoget.no with Gleam (lindbakk.com)

1 points by todsacerdoti 1h ago 0 comments

Show HN: PeekaTube – Preview YouTube summaries without opening the video (chromewebstore.google.com)

1 points by project_stain 1h ago 0 comments

Announcing Rust 1.89.0 (blog.rust-lang.org)

4 points by pjmlp 1h ago 0 comments

Leverage Points: Places to Intervene in a System (donellameadows.org)

2 points by tosh 1h ago 0 comments

List of Tesla senior executives departures in the past 6 months (twitter.com)

3 points by TheAlchemist 1h ago 0 comments

The Philosopher-Maker (philosophermaker.substack.com)

1 points by niho 1h ago 0 comments

Immortal Mags – Independent Magazines Repository (immortal-mags.xyz)

1 points by eligg 1h ago 0 comments

Does AI Boost Developer Productivity? – Yegor Denisov-Blanch, Stanford [video] (youtube.com)

3 points by swyx 1h ago 0 comments

Publish or Perish: The Board Game of Academic Survival (the-scientist.com)

3 points by erehweb 1h ago 0 comments

The Rise of Silicon Valley's Techno-Religion (nytimes.com)

3 points by mykowebhn 1h ago 1 comments

Default tab size changed from eight to four on GitHub (github.blog)

2 points by petercooper 1h ago 0 comments

OpenAI gets caught vibe graphing (theverge.com)

4 points by pera 1h ago 0 comments

The web isn't URL-shaped anymore (jonoalderson.com)

1 points by ingve 1h ago 0 comments

Show HN: Duck Duck Clicker – Free idle clicker: build a duck empire, golden duck (mergebrainrot.com)

1 points by liualexander112 2h ago 0 comments

GPT-5 New Params and Tools (cookbook.openai.com)

3 points by tosh 2h ago 0 comments

On The Value of Abstractions (cekrem.github.io)

1 points by thunderbong 2h ago 0 comments

Show HN: Brainrot Craft – Merge Italian Memes (No Download) (brainrotcraft.top)

1 points by liualexander112 2h ago 0 comments

Qwen-Image: AI Image Generation with Native Text Rendering (qwenimagen.com)

2 points by laiwuchiyuan 2h ago 0 comments

Turn Any Website into an API

8 pcl 3 8/8/2025, 5:10:46 AM parse.bot ↗

Comments (3)

vin047 · 30m ago

No information on pricing on the site.

runningmike · 2h ago

Nice idea. In practice many sites have different methods to prevent scraping. Large risk on doing things manually imho.

renegat0x0 · 10m ago

Huh, I I have been working on solution to that problem.

My project allows to define rules for various sites, so eventually everything is scraped correctly. For YouTube yet dlp is also used to augment results.

I can crawl using requests, selenium, Httpx and others. Response is via json so it easy to process.

The downside is that it may not be the fastest solution, and I have not tested it against proxies.

https://github.com/rumca-js/crawler-buddy