Show HN: Create your own finetuned AI model using Google Sheets (promptrepo.com)

59 points by QueensGambit 5h ago 26 comments

Show HN: ART – a new open-source RL framework for training agents (github.com)

52 points by kcorbitt 6h ago 7 comments

Show HN: Kexa.io – Open-Source IT Security and Compliance Verification

53 points by patrick4urcloud 8h ago 13 comments

Show HN: Keyboard Minesweeper – Speedrun the Classic, No Mouse Needed (kb-minesweeper.com)

3 points by pauljurgens 1h ago 0 comments

Show HN: Beatsync – perfect audio sync across multiple devices (github.com)

394 points by freemanjiang 1d ago 115 comments

Show HN: I built a fun AI tour guide into Google Street View (streetwhip.com)

7 points by rohanm93 2h ago 4 comments

Show HN: Jarvis-AI, an AI Agents network that kills admin work in big corporate (github.com)

4 points by oliomap 3h ago 0 comments

Show HN: An MCP server for understanding AWS costs

106 points by StratusBen 5d ago 23 comments

Show HN: AgenticSeek – Self-hosted alternative to cloud-based AI tools (github.com)

109 points by Fosowl 4d ago 18 comments

Show HN: An interactive demo of QR codes' error correction (qris.cool)

102 points by Xiione 5d ago 11 comments

Show HN: A Chrome extension that will auto-reject non-essential cookies (blog.bymitch.com)

281 points by mitch292 1d ago 165 comments

Show HN: Typeconf – Dynamic Configs in TypeScript (github.com)

2 points by mifydev 5h ago 0 comments

Show HN: I built a hardware processor that runs Python (runpyxl.com)

969 points by hwpythonner 2d ago 263 comments

Show HN: The $300K DevinAI Secret is Now Open Source (github.com)

3 points by sashimikun 6h ago 2 comments

Show HN: Sim Studio – Open-Source Agent Workflow GUI (github.com)

188 points by waleedlatif1 2d ago 55 comments

Show HN: Aisir – AI models deliberate and critique each other like a council (aisirai.com)

3 points by esamust 6h ago 1 comments

Show HN: Web Tool to Create a Universal Database MCP Server (centralmind.ai)

6 points by laskoviymishka 7h ago 4 comments

Show HN: Heart Rate Zones Plus – The first iOS app I developed (apps.apple.com)

97 points by tobias5 2d ago 80 comments

Show HN: Prettier Email Headers (emailheaders.dev)

9 points by bjhess 8h ago 0 comments

Show HN: Web-eval-agent – Let the coding agent debug itself (github.com)

83 points by neversettles 2d ago 12 comments

Show HN: Flowcode – Turing-complete visual programming platform (app.getflowcode.io)

168 points by gabigrin 1d ago 80 comments

Show HN: Open-source sound effects and react library to spice up your website (reactsounds.com)

3 points by lschneider 9h ago 1 comments

Show HN: Daily Digest of the Least Popular Posts on Hacker News (leastpopular.io)

3 points by tomgs 11h ago 1 comments

Show HN: A pure WebGL image editor with filters, crop and perspective correction (github.com)

240 points by axelMI 2d ago 84 comments

Show HN: I486SX_soft_FPU – Software FPU Emulator for NetBSD 10 on 486SX (github.com)

116 points by mezantrop 3d ago 40 comments

Show HN: Neurox – GPU Observability for AI Infra (github.com)

25 points by leeab 1d ago 22 comments

Show HN: Daily Jailbreak – Prompt Engineer's Wordle (vaultbreak.ai)

128 points by ericlmtn 3d ago 64 comments

Show HN: A Common Lisp implementation in development, supports ASDF (savannah.nongnu.org)

94 points by andreamonaco 3d ago 59 comments

Show HN: Built a API that returns your GitHub Contribution chart (github.com)

2 points by deepvinci 14h ago 0 comments

Show HN: CodeClarity – an open source source code analysis platform (codeclarity.io)

4 points by ceherzog 14h ago 0 comments

Show HN:I Open Sourced Deepwiki (github.com)

5 points by sashimikun 15h ago 0 comments

Show HN: My self-written hobby OS is finally running on my vintage IBM ThinkPad (github.com)

569 points by joexbayer 4d ago 115 comments

Show HN: Autarkie – Instant grammar fuzzing using Rust macros (github.com)

40 points by r9295 2d ago 4 comments

Show HN: I created snapDOM to capture DOM nodes as images with exceptional speed (github.com)

119 points by tinchox6 3d ago 41 comments

Show HN: I made a web-based, free alternative to Screen Studio (screenrecorder.me)

454 points by johnwheeler 2d ago 113 comments

Show HN: Tariff Calculator for Amazon (twitter.com)

35 points by wyxuan 20h ago 3 comments

Show HN: I built an AI that turns GitHub codebases into easy tutorials (github.com)

919 points by zh2408 11d ago 172 comments

Show HN: Generate discord timestamp that converts to each user's local timezone (discordtimestamp.cc)

4 points by jsamqiu 18h ago 0 comments

Show HN: Remote-Controlled IKEA Deathstar Lamp (gitlab.com)

290 points by sephalon 3d ago 55 comments

Show HN: Rad Type - Can we make gamepad typing fast? (tyleo.com)

40 points by tyleo 1d ago 37 comments

Show HN: Bhvr, a Bun and Hono and Vite and React Starter (bhvr.dev)

125 points by stevedsimkins 3d ago 95 comments

Show HN: GS-Calc – A modern spreadsheet with Python integration (citadel5.com)

112 points by jpiech 5d ago 21 comments

Show HN: Neuro Tools, a collection of tools to help neurodivergent people (neurotools.app)

20 points by martin-buur 1d ago 9 comments

Show HN: Classic games that look like a spreadsheet for office workers (boredspreadsheet.com)

7 points by kevinclelland 1d ago 0 comments

Show HN: Colanode, open-source and local-first Slack and Notion alternative (github.com)

145 points by hakanshehu 6d ago 49 comments

Show HN: POC to scrape and structure HTML into JSON for RAG (structured.pages.dev)

8 points by nirvanist 1d ago 6 comments

Show HN: NanoAgent, zero-dependency 1k-LOC AI-agent runtime (github.com)

11 points by hbbio 2d ago 0 comments

Show HN: Rowboat – Open-source IDE for multi-agent systems (github.com)

160 points by segmenta 8d ago 51 comments

Show HN: I made an app to learn guitar scales (guitartonic.com)

2 points by true_pk 1d ago 2 comments

Show HN: Discorss – RSS Feeds for Discord (discorss.fldr.zip)

10 points by wyxuan 1d ago 5 comments

Debezium to olake.io – PhysicsWallah switch for CDC

2 pkhodiyar 0 4/30/2025, 12:44:45 PM

We recently hosted a small online meetup at OLake where a Data Engineer at PhysicsWallah, walked through why his team dropped Debezium and moved to OLake’s “MongoDB → Iceberg” pipeline.

Video (29 min): https://www.youtube.com/watch?v=qqtE_BrjVkM

If you are someone who prefer text, here’s the quick TLDR;

Why Debezium became a drag for them: 1. Long full loads on multi-million-row MongoDB collections, and any failure meant restarting from scratch 2. Kafka and Connect infrastructure felt heavy when the end goal was “Parquet/Iceberg on S3” 3. Handling heterogeneous arrays required custom SMTs 4. Continuous streaming only; they still had to glue together ad-hoc batch pulls for some workflows 5. Ongoing schema drift demanded extra code to keep Iceberg tables aligned

What changed with OLake? -> Writes directly from MongoDB (and friends) into Apache Iceberg, no message broker in between

-> Two modes: full load for the initial dump, then CDC for ongoing changes — exposed by a single flag in the job config -> Automatic schema evolution: new MongoDB fields appear as nullable columns; complex sub-docs land as JSON strings you can parse later

-> Resumable, chunked full loads: a pod crash resumes instead of restarting

-> Runs as either a Kubernetes CronJob or an Airflow task; config is one YAML/JSON file.

Their stack in one line: MongoDB → OLake writer → Iceberg on S3 → Spark jobs → Trino / occasional Redshift, all orchestrated by Airflow and/or K8s.

Posting here because many of us still bolt Kafka onto CDC just to land files. If you only need Iceberg tables, a simpler path might exist now. Curious to hear others’ experiences with broker-less CDC tools.

(Disclaimer: I work on OLake and hosted the meetup, but the talk is purely technical.)

Check out github repo - https://github.com/datazip-inc/olake

Comments (0)

No comments yet