To Hell with Good Intentions (1968) [pdf] (uvm.edu)

We're building infer.bid, a real-time matchmaking platform that connects suppliers and consumers of AI inference compute. Our primary focus is on open-source models initially, aiming to tackle high inference costs and GPU scarcity by introducing dynamic pricing through a real-time bidding system.

We’d love your input:

If you use services like OpenRouter, Replicate, or Hugging Face, what are your biggest pain points? Is cost your primary issue, or are you looking for more advanced features such as intelligent model routing, caching, or prompt optimization to save costs?

Would a platform that lets you dynamically bid for inference compute be compelling for your use cases? How important is pricing transparency and the ability to choose your hardware provider directly?

We're trying to better understand different user scenarios and prioritize features that directly address the community's needs. Any insights or feedback would be greatly appreciated!

Thank you!

cratermoon · 18h ago

We've got xena's Skypilot already, how would you differentiate your offering? https://xeiaso.net/talks/2025/ai-chatbot-friends/

cryptolibertus · 7h ago

This is still down the path of running your own infra which is a choice the inference consumer needs to make regarding the setup they have and if the management is worth it. We are also looking into how to encrypt or privatize prompts with inference providers to maintain privacy of your data. Infer.bid is more about having an open marketplace of inference providers to compete on price for your business. In the future your app/biz organization might just need inference and not have the hassle of maintaining the infra for it. Instead you could just consume inference like electricity.

To Hell with Good Intentions (1968) [pdf] (uvm.edu)

Lab Rats Sperm Race [video] (youtube.com)

Resisting the Crawl (cousinthrockmorton.github.io)

Elizabeth Holmes's Partner Has a New Blood-Testing Startup (nytimes.com)

This Los Angeles port is among the first casualties of Trump's trade war (washingtonpost.com)

Improved fingerprint search algo helps crack a 48-Year-old murder case (nytimes.com)

Satellite will have to be turned off when it floats over the US (thecooldown.com)

Why is appending to the innerHTML property bad? (stackoverflow.com)

Karakeep: The Bookmark Everything App (karakeep.app)

Zircon – Kernel for Google's Fuchsia (fuchsia.dev)

What's the Cost to Society of Pollution? Trump Says Zero (nytimes.com)

Roast startup pitches for 14 mins (any feedback?) (youtube.com)

To build affordable homes on a mass scale, Levitt sought cost-cutting measures (wsj.com)

FocusedValues in SwiftUI (shadowfacts.net)

I made porn addiction quitting app Unlust. Made by someone who's been there (unlustapp.com)

A quarter decade of learnings from scaling RAG to users (hello-jp.net)

The surgeon who used F1 pitstop techniques to save lives of babies (thetimes.com)

The Connoisseur of Desire (nybooks.com)

Key open source challenges in developing countries (2023) (opensource.com)

Why are coffee stains darker at the edges? (why.is)

Show HN: PondPilot – Run SQL on DuckDB/CSV/etc. Locally via WASM (Open-Source) (app.pondpilot.io)

Valuation of Ride Hailer, Ola Cut from $7.3B to $1.25B (entrackr.com)

All-in-one AI marketing agent and copilot desktop app to cross-post to any sites (github.com)

Harvard offers free tuition for families making $200K or less: My thoughts (greyenlightenment.com)

Senate Hearing on Artificial Intelligence [video] (youtube.com)

The overlooked masterpiece full of coded messages about World War One (bbc.com)

Visit the Arctic vault holding back-ups of great works (bbc.com)

A GitHub MCP Server Extension for Zed (github.com)

Check out this cool tool to find creators who nail product promos (old.reddit.com)

Show HN: Color Name Game (andrewmatte.itch.io)

Could the English Language Die? (theguardian.com)

Why Bell Labs Worked (1517.substack.com)

Moral Outrage Predicts the Virality on Social Media, but Not the Support (journals.sagepub.com)

Show HN: Sqlitemap, a persistent map implementation backed by SQLite for C++ (github.com)

Hyperion: The Tallest Tree in the World (ourplnt.com)

Ask HN: What do you think of this color changing Rubik's Cube variant?

The Facts in the Case of the Great Beef Contract (1867) (americanliterature.com)

VIC-20 (en.wikipedia.org)

Beating the fastest lexer generator in Rust (alic.dev)

Genetic mutation lets some people thrive on just 4 hours of sleep (livescience.com)

Clone.fyi (clone.fyi)

How to Use the Beef Framework over WAN Like a Hacking Pro[2024] (thekitchentoday.com)

How climate change is altering bird migration (dw.com)

Dramatically improve microscope resolution with LED array and Ptychography [video] (youtube.com)

Large Language Models Are Autonomous Cyber Defenders (arxiv.org)

A Django rest API key package (github.com)

Why Laptop Batteries Can't Get Any Bigger (For Now) (ifixit.com)

Taliban suspends chess over gambling concerns (bbc.com)

Cutenews 2.0 (github.com)

Burrito Now, Pay Later (enterprisevalue.substack.com)

Ask HN: Would You Use a Real-Time Auction Marketplace for AI Inference?

Comments (3)