Image to Text Converter (image-1.org)

1 points by ghhjklhga 1m ago 0 comments

Ask HN: What are your honest thoughts on AI tools replacing programmers

1 points by kurrupttt 8m ago 0 comments

Unsafe Pricing at Any Scale (melkat.blog)

1 points by Bluestein 12m ago 0 comments

Show HN: PollMasters – A desktop app to create, send, and track WhatsApp polls (github.com)

1 points by smithscotts 18m ago 1 comments

LocalScore – Local AI Benchmark by Mozilla Builders (localscore.ai)

1 points by nalinidash 28m ago 0 comments

Superabundance and the Infinite Game (domofutu.substack.com)

2 points by wjb3 29m ago 1 comments

U.S. Pauses Exports of Airplane and Semiconductor Technology to China (nytimes.com)

1 points by bookofjoe 32m ago 2 comments

WebSockets guarantee order – so why are my messages scrambled? (sitongpeng.com)

2 points by todsacerdoti 35m ago 0 comments

Ask HN: Why reinvent front-end frameworks and static site builders?

1 points by keepamovin 37m ago 0 comments

Of course the Apple Network Server can be hacked into running Doom (oldvcr.blogspot.com)

5 points by classichasclass 42m ago 0 comments

Buy a Blog Post (scannedinavian.com)

2 points by akkartik 44m ago 0 comments

Map of the Known Human Metabolic Pathways (2017) (old.reddit.com)

1 points by downboots 46m ago 0 comments

Why does it matter if AI is sentient or not? (2084.substack.com)

1 points by thatbritishspy 49m ago 2 comments

Rent-Setting Algorithms Find Legal Lifeline (wsj.com)

1 points by JumpCrisscross 51m ago 0 comments

OpenAI featured chatbot is pushing extreme surgeries to "subhuman" men (citationneeded.news)

2 points by OuterVale 53m ago 0 comments

OpenAI models defy human commands, actively resist orders to shut down (computerworld.com)

2 points by _jcrossley 53m ago 0 comments

Ask HN: What are your thoughts on AI recruiters?

1 points by perthvandeda 1h ago 2 comments

Firefox now allows you to add custom search engine manually by default (bugzilla.mozilla.org)

1 points by gslin 1h ago 0 comments

Show HN: RustTensor: a Rust Library for Tensor Computation and ML Learning (github.com)

1 points by ramram6278 1h ago 0 comments

Enhancing MySQL: MySQL improvement project (github.com)

9 points by bratao 1h ago 5 comments

Google AI Edge Gallery (github.com)

1 points by xnx 1h ago 0 comments

Formal Modeling and Analysis of Distributed (Event-Driven) Systems (github.com)

2 points by ot 1h ago 1 comments

Waymo drives into a flooded road, results in the passenger getting stuck (twitter.com)

7 points by lopkeny12ko 1h ago 0 comments

Ironclad: Unix-like operating system kernel written in SPARK and Ada (codeberg.org)

5 points by thunderbong 1h ago 0 comments

Show HN: OBDium – Car Diagnostics Redefined (github.com)

1 points by provrb 1h ago 0 comments

Stepping Back (rjp.io)

3 points by rjpower9000 1h ago 1 comments

Progressive JSON (overreacted.io)

86 points by kacesensitive 1h ago 43 comments

Volcanic eruptions trigger ice formation in clouds (llnl.gov)

2 points by gmays 1h ago 0 comments

Giant microwave may change the future of war (technologyreview.com)

5 points by lanfeust6 1h ago 0 comments

First year's code doesn't matter (onboardedhq.substack.com)

3 points by plentysun 1h ago 1 comments

Spiritual Enclosure / Rubén Valdez (archdaily.com)

1 points by 9woc 1h ago 0 comments

Jemalloc (github.com)

2 points by amusingimpala75 1h ago 0 comments

AI helps researchers discover previously unknown molecules (thebrighterside.news)

1 points by geox 1h ago 0 comments

Apple's Reliance on China Is About Far More Than Labor Costs (bloomberg.com)

3 points by petethomas 2h ago 0 comments

White House says it will announce new pick for NASA chief (cnn.com)

6 points by ChrisMarshallNY 2h ago 0 comments

Show HN: I built a simple Google Maps lead generator over the weekend (lead-generator-one.vercel.app)

1 points by sonny177 2h ago 0 comments

Jamie Raskin Launches Investigation Into Trump's "Corrupt Pardon Spree" (politicususa.com)

11 points by Jacquie11 2h ago 2 comments

Autocratic Capitalism: An Introduction (daily.jstor.org)

7 points by mdp2021 2h ago 0 comments

Research: Multiple-answer question format in exams improves student attainment (phys.org)

2 points by PaulHoule 2h ago 0 comments

Show HN: Tracking Merged PRs by OpenAI's Codex and GitHub's Copilot (github.com)

3 points by zekone 2h ago 0 comments

Show HN: Patio – Rent tools, learn DIY, reduce waste (patio.so)

47 points by GouacheApp 2h ago 27 comments

Green Tea Garbage Collector (github.com)

3 points by kristianp 2h ago 1 comments

Reduced OpenAI RAG costs by 70% by using a pre-check API call

1 points by Kong91 2h ago 2 comments

KSL Investigates: How to Avoid Inheriting a Timeshare You Don't Want (2021) (ksltv.com)

3 points by josephcsible 3h ago 3 comments

Energy Dept. Unveils Supercomputer That Merges with A.I (nytimes.com)

1 points by bookofjoe 3h ago 1 comments

Show HN: The Daily Quandary (whythink.org)

1 points by rjhackin 3h ago 0 comments

Ask HN: Magic wand to fix one thing about cloud software development?

1 points by uptownhr 3h ago 0 comments

Where does debt go after death? (ramseysolutions.com)

6 points by downboots 3h ago 0 comments

The 55% Regret Club: How AI-First Companies Are Learning Lessons the Hard Way (groktop.us)

14 points by tickbyte 3h ago 7 comments

Writing an LLM from scratch, part 15 – from context vectors to logits (gilesthomas.com)

3 points by gpjt 3h ago 0 comments

Show HN: Local LLM AIME benchmarking tool

1 belluxx 0 5/31/2025, 12:11:21 PM github.com ↗

I made this simple tool to compare local LLMs. Any provider that supports OpenAI-like APIs can be used (LMStudio, Llama.cpp, Ollama) but you can also use Openrouter/OpenAI if you change the base URL accordingly.

In my opinion it is not particularly useful for comparing different models from different companies since some models are optimized heavily on math or even trained on AIME problems.

However it is really useful for testing different quantizations of the same model or the same quantization from different providers.

Let me know what you think about it!

Also check the README to see some examples of the results you will get from it.

Comments (0)

No comments yet