AI 'Map Reduce': Scaling AI Tasks (danielsada.tech)

1 points by furkansahin 9s ago 0 comments

The Centenary of the Loudspeaker (nickhunn.com)

1 points by fanf2 21s ago 0 comments

Show HN: RemoteMCPList – A community directory for MCP servers

1 points by lukaesch 6m ago 0 comments

Junior devs not interested in software engineering (old.reddit.com)

1 points by lopkeny12ko 10m ago 0 comments

GitHub hides "cyberpunk" and "cyberpunk2077" topics (github.com)

1 points by yu3zhou4 10m ago 1 comments

New CRISPR technique could rewrite future of genetic disease treatment (unsw.edu.au)

1 points by 01-_- 14m ago 0 comments

The Illumos Cafe: Another Cozy Corner for OS Diversity (it-notes.dragas.net)

1 points by todsacerdoti 14m ago 0 comments

Chat Control is back and we've got two months to stop the EU CSAM scanning plans (tuta.com)

3 points by subidon 17m ago 0 comments

Miniforge: Community-led conda-forge distribution (github.com)

1 points by cl3misch 19m ago 0 comments

Ofcom £20k fine of Delaware-incorporated 4chan is illegal says US law firm (mobilenewscwp.co.uk)

1 points by 01-_- 21m ago 0 comments

The Nvidia AI GPU black market [video] (youtube.com)

1 points by testrun 25m ago 0 comments

A Trick to Use MkMerge at the Top Level of a NixOS Module (gist.github.com)

1 points by todsacerdoti 30m ago 0 comments

Spritacular – a citizen science project of Transient Luminous Events (TLEs) (spritacular.org)

1 points by NKosmatos 35m ago 1 comments

Parquet Content-Defined Chunking (huggingface.co)

1 points by marklit 42m ago 0 comments

Show HN: We made a local-first LLM assisted journaling tool (github.com)

1 points by rakag 43m ago 0 comments

The joy of recursion, immutable data, & pure functions: Making mazes with JS (jrsinclair.com)

1 points by jrsinclair 44m ago 0 comments

Taiwan's "silicon shield" could be weakening (technologyreview.com)

3 points by billybuckwheat 49m ago 0 comments

BriskBard 4.6 Released (briskbard.com)

1 points by SalvadorDF 51m ago 0 comments

Ask HN: What's your go‑to for mapping cross‑team dependencies?

2 points by dwarkeshpatil11 51m ago 0 comments

Show HN: I used LLMs to build an OS AI voice agent that guarantees accurate data (github.com)

1 points by Jeff_Morton_AI 52m ago 1 comments

Mindless Machines, Mindless Myths (lareviewofbooks.org)

2 points by lermontov 54m ago 0 comments

Show HN: Founderly – an AI cofounder to take you from idea to launch

1 points by arunbhatia 59m ago 0 comments

How Will AI-Driven Learning Platforms Reshape Enterprise Upskilling?

2 points by thiruarasu 1h ago 1 comments

Meta's DINOv3: Self-supervised learning for vision at unprecedented scale (ai.meta.com)

1 points by jxntb73 1h ago 0 comments

The Cost of Slow Feedback Loops (revontulet.dev)

4 points by rednafi 1h ago 1 comments

I talked to Sam Altman about the GPT-5 launch fiasco (theverge.com)

1 points by isaacfrond 1h ago 0 comments

Software Decoding and the Future of Mobile Video (streamingmedia.com)

3 points by breve 1h ago 0 comments

Philips Hue's new bridge could turn your lights into motion sensors (theverge.com)

1 points by thunderbong 1h ago 0 comments

AI That Customizes Every Email-Is This Worth Building?

1 points by ivyiscool 1h ago 0 comments

Find MCP Servers. Build AI Agents Quickly (mcp.so)

1 points by doener 1h ago 0 comments

AlphaEarth Foundations: a universal embedding for Earth observation data (newsletter.caffeinatedengineer.dev)

1 points by caffeinated-eng 1h ago 0 comments

AI That Customizes Every Email-Is This Worth Building?

1 points by ivyiscool 1h ago 0 comments

What's the Difference in Token Formats–and Why It Matters (ixopay.com)

1 points by siroj 1h ago 0 comments

How to Make the Most of Veo3, WAN 2.2, Hailuo-AI, and Qwen-Image on Textideo (indiehackers.com)

1 points by Lily12138 1h ago 0 comments

Unification (2018) (eli.thegreenplace.net)

15 points by asplake 1h ago 0 comments

Always Winning: Reverse engineering a festival app's mini-games with Frida (kopanko.com)

1 points by pcktm 1h ago 0 comments

'Tradwife', 'delulu' and 'skibidi' among new words added to Cambridge Dictionary (news.sky.com)

6 points by austinallegro 1h ago 8 comments

A short statistical reasoning test (emiruz.com)

10 points by usgroup 1h ago 0 comments

Model Evaluation (ampcode.com)

3 points by tosh 1h ago 0 comments

Ask HN: How do you set Newsletter pricing? Confused 8k

3 points by karanveer 1h ago 2 comments

From Monolith to Cloud: Automating Your Migration Journey (blog.qaware.de)

3 points by baquero 1h ago 0 comments

Finding a Successor to the FHS (lwn.net)

2 points by firexcy 1h ago 0 comments

The last time you experienced pure, undistracted music – how long ago? (serenesound.app)

3 points by cunjieliu 1h ago 0 comments

Climate change makes South Asia's monsoon season more dangerous (apnews.com)

7 points by Brajeshwar 2h ago 0 comments

Show HN: A tool to discover rapidly rising AI open source projects (trickle.so)

1 points by samdychen 2h ago 0 comments

Sam Altman says 'yes,' AI is in a bubble (theverge.com)

9 points by madeforhnyo 2h ago 2 comments

When Philosophy Meets AI (github.com)

1 points by msndr 2h ago 0 comments

Web apps in a single, portable, self-updating, vanilla HTML file (hyperclay.com)

135 points by pil0u 2h ago 32 comments

EloqKV, a distributed database with Redis compatible API (GPLv2 and AGPLv3) (github.com)

6 points by cloudsql 2h ago 0 comments

Wan 2.2, a new Video generation AI model (wan.video)

2 points by acoye 2h ago 0 comments

Show HN: Fast360 – A web tool to benchmark open-source OCR models side-by-side

3 yanaimngvov 1 8/18/2025, 3:19:44 AM fast360.xyz ↗

Hey HN,

Like many of you, I've been building RAG pipelines recently, and constantly hit a wall at the very first step: getting clean, structured Markdown from PDFs.

I found myself in a loop of "environment hell"—spinning up different Conda environments to test Marker, then PP-StructureV3, then MinerU, just to see which one worked best for a specific paper or financial report. It was a massive time sink. Static leaderboards weren't much help, because they don't tell you how a model will perform on your specific, messy document.

So, I built the tool I wished I had. It's a simple web utility that I call an "OCR Arena."

You can try it here: https://fast360.xyz

The idea is simple: upload a document, select from a lineup of 7 leading open-source models, and it runs them all in parallel, showing you the results side-by-side. The goal is to get you from "which parser should I use?" to having the best possible Markdown in under a minute.

It's completely free, and I made sure there's no login/signup required so you can try it with zero friction. Here’s a quick GIF of the workflow:

https://github.com/shijincai/fast360/blob/main/nologin.gif

The tech stack is a pretty standard setup: Next.js/React on the frontend, a Node.js/Express backend acting as a BFF, and a Python service that manages the model execution via a Redis/BullMQ queue.

This is a web service, not an open-source project, but I've set up a public GitHub repo to act as an information hub, a place to track community feedback, and to share more about the tech. You can find that here:

GitHub: https://github.com/shijincai/fast360

I built this to solve my own problem, but I'm hoping it might be useful to some of you as well. I'll be here all day to answer any questions and listen to your thoughts.

Comments (1)

yanaimngvov · 5h ago

One of the most fascinating (and challenging) parts of building this was seeing just how wildly different the "best" model can be depending on the document type.

For example, during testing, I found that Marker is an absolute champion for clean, single-column layouts like blog posts. But throw a dense, multi-column academic paper at it, and MinerU often produces a far superior, structured output with proper LaTeX. Then, for a complex invoice table, PP-StructureV3 frequently beats both of them.

This really solidified my belief that a "one-size-fits-all" parser is a myth. The future seems to be less about finding a single perfect model and more about building a quick, effective workflow for selecting the right specialist for the job. It's a classic "routing" problem, and this tool is my attempt at solving the first step of that puzzle.