Show HN: Exosphere – Platform for async/batch AI agents

4 aikin-nivedit 2 5/21/2025, 7:06:20 PM

Hey HN,

We built Exosphere (exosphere.host) – a platform to orchestrate and run batch AI agents on large data with connectors, autoscaling, and affordable inference (up to 75% cheaper). Think of it as a control plane for async AI workloads.

Why we built it: Running background AI workflows (like summarising 1M support chats or processing 10K PDFs) is messy – you need queueing, scaling, model hosting, cost control, and integration with your systems. Most infra today is optimised for chat apps, not bulk tasks or pipelines. This is going to become messier with multistep AI agents/workflows coming in.

What Exosphere does: - Supports batch AI agents with parallelism, retries, and memory - Integrates with tools like S3, Notion, GCS, Pinecone etc - Works with open-source models like DeepSeek, LLaMA, and Claude via API - Has a soon-to-be open-source orchestrator called Orbit (built from scratch) - Cost-optimised infra tuned for large data inference and delayed inference

Easy onboarding, no GPU setup required

Example use cases: - Classify or extract info from 100K PDFs - Run retrieval-based QA across millions of records - Summarize and route large volumes of tickets or feedback - Batch label images or text for finetuning

We’d love feedback from this community – thoughts on dev experience, connectors to add, model support, or features you'd want in the agent platform.

You can try it out here or just reply here if you want some free credits for trying open-source models in batch.

Thanks! – Nivedit (ex-Azure OpenAI) and the team

Comments (2)

mrv_asura · 1d ago

Awesome! Curious how you guys manage to make it "75% cheaper"?

aikin-nivedit · 19h ago

Bunch of optimisations for large data, including: Smart Batching, Prompt Caching, Custom Routing Algorithms and hardware planning for batch.

Ask HN: Do you have a side project you're getting tired of?

Ask HN: Agent / workflow frameworks or roll your own?

Ask HN: Conversational AI to Learn a Language

Ask HN: How do you promote your personal projects with a limited budget?

We sold our first AI agent to a legacy industry–now we're stuck. Help us Advice?

Ask HN: What makes a programming language great for code generation?

Ask HN: Places in the UK / Europe Related to computers

Ask HN: Has anyone been able to overcome crippling executive dysfunction?

Ask HN: How to Make Friendster Great?

Tell HN: Mozilla is preparing to remove bookmark keywords

I spent 15 years developing a tool to make sense of software version numbers

Ask HN: Pros and cons of offering a self-hosted version of your SaaS?

Ask HN: Engineering Statics and Dynamics book recommendation

I'm Peter Roberts, immigration attorney, who does work for YC and startups. AMA

More than 1,500 AI projects are now vulnerable to a silent exploit

Ask HN: How are you using LLMs for research on a library of journal articles?

Modern Python Boilerplate – good package basic structure

Ask HN: Where to find UX design resources?

What If Every Picture You've Ever Seen Already Exists?

Tell HN: The Hetzner Experience - Invisible Outages

Ask HN: Don't You Mind That LLMs Are Mostly Proprietary?

Built an AI Tool? List It for Free on Aisofto.com

Is this necassary to fail at first time? No money with 280 Users

Ask HN: Does the languages we speak affect the way we think?

Big Beautiful Bill R&D Tax: Will tech go on a hiring spree again?

Ask HN: Anyone working in traditional ML/stats research instead of LLMs?

How to Fix the Gaming Industry

Ask HN: Do people actually pay for small web tools?

Ask HN: When do you just give up and ship it?

Ask HN: How do you use AI for development in high security environments?

Show HN: Exosphere – Platform for async/batch AI agents

Comments (2)