Iceberg, the Right Idea – The Wrong Spec

Comments (1)

DannyPage · 11h ago

We just finished implementing Iceberg on top of a large set of Parquet files, stored in S3. It’s a neat idea to be able to turn a lot of data files into a SQL database, but I absolutely understand the pain and confusion the author writes, especially around how it handles metadata. It creates a lot of those files and makes a large mess of the directory. Some queries that I know would return a single parquet file take up to 30 seconds.

I don’t think we’ll scrap it and there are certainly ways to speed up the problematic aspects of querying the catalog, but I’m also rooting for DuckLake to make it a lot more approachable by not completely shying away from the database as an idea.

Show HN: I rewrote an outdated React Native map clustering library (github.com)

Show HN: OffChess – Offline chess puzzles app (offchess.com)

Show HN: Pyhoff – Connect Python ML Models to Beckhoff/WAGO IO Hardware (github.com)

Show HN: Snub – A fast, lightweight file search CLI for Windows (written in C) (github.com)

Show HN: A rain Pomodoro with brown noise, ASMR, and Middle Eastern music (forgetoolz.com)

Show HN: Jukebox – Free, Open Source Group Playlist with Fair Queueing (jukeboxhq.com)

Show HN: An obsidian plugin inspired by the 'I deleted my second brain' article (versen.substack.com)

Show HN: I built "Schnippi", my dream screenshot Chrome extension (chromewebstore.google.com)

Show HN: NYC Subway Simulator and Route Designer (buildmytransit.nyc)

Show HN: Multi-session Claude Code manager with async workflow (github.com)

Show HN: I built a tool to solve window management (aboveaverageuser.com)

Show HN: Sumble – knowledge graph for GTM data – query tech stack, key projects (sumble.com)

Show HN: I wrote a "web OS" based on the Apple Lisa's UI, with 1-bit graphics (alpha.lisagui.com)

Show HN: From Photos to Positions: Prototyping VLM-Based Indoor Maps (arjo129.github.io)

Show HN: Piplo helps you stay in touch with the people who matter (apps.apple.com)

Show HN: A Truth Table Generator Written in Common Lisp (logic.manoel.dev)

Show HN: Ossia score – A sequencer for audio-visual artists (github.com)

Show HN: Unlearning Comparator, a visual tool to compare machine unlearning (gnueaj.github.io)

Show HN: Modernized file manager and program manager from Windows 3.x (github.com)

Show HN: Piano Trainer – Learn piano scales, chords and more using MIDI (github.com)

Show HN: A Language Server Implementation for SystemD Unit Files (github.com)

Show HN: Track the AI-generated code in your repo (github.com)

Show HN: Dashboard tracking all GitHub PRs and analyzing Code Agent activity (github.com)

Show HN: a community for collaborating on sideprojects (relentlessly.no)

Show HN: ModelFetch – Deploy MCP servers anywhere TypeScript/JavaScript runs (github.com)

Show HN: CLI for smooth ESLint adding or rules migration (github.com)

Show HN: AirBending – Hand gesture based macOS app MIDI controller (nanassound.com)

Show HN: Integrated System for Enhancing VIC Output (github.com)

Show HN: CCLeaderboard – See who's burning through the most Claude Code tokens

Show HN: PulseTimer – A clean, customizable work/break timer you can self-host (timer.toxi360.org)

Show HN: BunkerWeb – the open-source and cloud-native WAF (docs.bunkerweb.io)

Show HN: Free Unlimited Photo Enhancer, Background Remover, AI Image Gen, etc. (github.com)

Show HN: I AI-coded a tower defense game and documented the whole process (github.com)

Show HN: Pixel Art Generator Using Genetic Algorithm (github.com)

Show HN: Interactive pinout for the Raspberry Pi Pico 2 (pico2.pinout.xyz)

Show HN: A simpler geofence reminder UI (apps.apple.com)

Show HN: The Ordeal Visualizer (dusted.dk)

Show HN: Simple wrapper for Chrome's built-in local LLM (Gemini Nano) (github.com)

Show HN: HomeBrew HN – Generate personal context for content ranking (hackernews.coffee)

Show HN: Fast Thermodynamic Calculations in Python (dlr-institute-of-future-fuels.github.io)

Show HN: GraphFlow – A lightweight Rust framework for multi-agent orchestration (github.com)

Show HN: HireIndex – A Searchable Directory for Who Wants to Be Hired on HN (hireindex.xyz)

Show HN: I built a single API to post on all social platforms (postforme.dev)

Show HN: Gore – A Doom Engine Port in Go (github.com)

Show HN: ParsePoint – AI OCR that pipes any invoice straight into Excel (parsepoint.app)

Show HN: Life_link, an app to send emergency alerts from anywhere

Show HN: I made a CLI tool to batch convert handwritten notes to Markdown (github.com)

Show HN: Kuvasz – an open-source uptime and SSL monitoring service (kuvasz-uptime.dev)

Show HN: I built sinkedin – a LinkedIn but for flauting failures and screwups (sinkedin.app)

Show HN: Llms.txt Validator (llmstxtvalidator.dev)

Iceberg, the Right Idea – The Wrong Spec

Comments (1)