I've built an open source streaming library for async pipelines

Comments (1)

ju-bezdek · 1d ago

I’ve been working with LLMs a lot lately, and one consistent UX bottleneck is inference speed.

Many tasks follow this pattern: process small chunks → batch for inference → split results again. Parallelizing helps, but naive asyncio.gather approaches often backfire—each stage waits on the slowest batch, killing responsiveness. Mixing fast per-item logic with slower batch steps needs smarter coordination.

Technical approach: Built a pipeline library that handles the streaming coordination automatically. Uses async generators throughout with intelligent queuing for order preservation when needed.

Architecture decisions:

Stream-first design: Results flow by default, with optional collection Flexible ordering: Choose between speed (unordered) and sequence (ordered) Memory efficiency: O(batch_size) memory usage, not O(dataset_size) Backpressure handling: Automatic coordination between fast and slow stages Error boundaries: Configurable failure strategies at task level.

Show HN: Hacker News historic upvote and score data (hn.dunkirk.sh)

Show HN: Ephe – A minimalist open-source Markdown paper for today (github.com)

Show HN: Tiptap AI Agent – Add AI workflows to your text editor in minutes

Show HN: A free online tool to export all PDF annotations (pdfhighlightextractor.com)

Show HN: An Alfred workflow to open GCP services and browse resources within (github.com)

Show HN: AirAP AirPlay server – AirPlay to an iOS Device (github.com)

Show HN: Controlling 3D models with voice and hand gestures (github.com)

Show HN: Gradle plugin for faster Java compiles (github.com)

Show HN: I wrote a Java decompiler in pure C language (github.com)

Show HN: Localize React apps without rewriting code (github.com)

Show HN: Mosaique.info – Global news in context (solo dev, no ads, no tracking) (mosaique.info)

Show HN: PinSend – Share text between devices using a PIN(P2P, no login) (pinsend.app)

Show HN: LLMFeeder – Browser extension to extract clean content for LLM context (github.com)

Show HN: I build one absurd web project every month (absurd.website)

Show HN: Thriftled – DoorDash for local thrift stores (books and clothes first) (thriftled.com)

Show HN: Asciilator.com (asciilator.com)

Show HN: Kan.bn – An open-source alterative to Trello (github.com)

Show HN: All-in-one platform for AI image generation (imageninja.ai)

Show HN: I'm Building Ahrefs for AI Search Results (linrush.com)

Show HN: A toy version of Wireshark (student project) (github.com)

Show HN: Onlook – Open-source, visual-first Cursor for designers (github.com)

Show HN: Slurm-web – open-source lightweight web UI for Slurm HPC/AI clusters (slurm-web.com)

Show HN: CrowdRender – collaborative rendering plugin for Blender (crowd-render.com)

Show HN: Penny-1.7B Irish Penny Journal style transfer (huggingface.co)

Show HN: Moon Phase Algorithms for C, Lua, Awk, JavaScript, etc. (github.com)

Show HN: Patio – Rent tools, learn DIY, reduce waste (patio.so)

Show HN: Ultra-lightweight chunker library with emoji support (github.com)

Show HN: MBCompass – Android Compass App (github.com)

Show HN: I made an AI that turn live lecture into structured notes,mind-maps,PDF (notorium.app)

Show HN: .NET Threading Mystery Classes (github.com)

Show HN: Use Just Your Voice To Author Flow Charts (loom.com)

Show HN: I built an AI Agent that uses the iPhone (github.com)

Show HN: Text to 3D simulation on a map (does history pretty well) with gmaps++ (worldlens.co)

Show HN: SQLxport – Export SQL Query Results to Parquet, CSV, and S3 (github.com)

Show HN: Page Magic: Use AI to customize any web page (github.com)

Show HN: pgarrow – A SQLAlchemy PostgreSQL dialect for ADBC (github.com)

Show HN: I made a scripting language run in the browser with no HTML (github.com)

Show HN: Psuedocode Expander (github.com)

Show HN: A Implementation of Alpha Zero for Chess in MLX (github.com)

Show HN: Agno – A full-stack framework for building Multi-Agent Systems (github.com)

Show HN: Rethinknig Serverless – Services, Observers, and Actors Now Available

Show HN: Legal Eyes – Turn casual text into legalese with one click (legaleyes.uk)

Show HN: FLOX – C++ framework for building trading systems (github.com)

Show HN: PunchCard Key Backup (github.com)

Show HN: Haptics in iOS Safari (github.com)

Show HN: Cmd-K for the Terminal (github.com)

Show HN: Compliant LLM toolkit for ensuring compliance & security of AI systems (github.com)

Show HN: Winhider – Hide windows from screenshare and Taskbar/Taskswitcher (github.com)

Show HN: I built an open source clone of Grok's DeepSearch (github.com)

Show HN: Fast Random Library for C++17 (github.com)

I've built an open source streaming library for async pipelines

Comments (1)