Baltimore Assessments Accidentally Subsidize Blight–and How We Can Fix It (progressandpoverty.substack.com)

I'm hitting a wall with our CI/CD pipeline, and I'm curious if this is just me or a universal struggle. We've got a pretty standard setup—a bunch of unit, integration, and end-to-end tests running on every commit. Lately, it feels like I'm spending more time debugging failing builds than writing actual code.

Just yesterday, I spent three hours trying to reproduce a failing end-to-end test that only occurred on the main branch. It passed locally and on the re-run, but the initial failure was a complete mystery. It's a massive productivity sink.

So I'm genuinely curious:

How much time do you realistically spend each week just debugging failed CI/CD builds? (And be honest—I'm not going to tell your boss.)

What's the absolute worst part of the process for you? Is it the context-switching, sifting through hundreds of lines of logs, or dealing with tests that pass locally but fail in CI?

What kind of a CI/CD failure is the most frustrating? Flaky tests? Environment-specific issues? That one random timeout?

If you could wave a magic wand and solve one thing about build failures, what would it be?

I'm hoping to hear some stories and maybe even learn a few tricks from you all. Thanks in advance for sharing!

Comments (1)

mmarian · 5h ago

My personal frustration is with service tests involving testcontainers: https://developerwithacat.com/blog/032025/test-containers-ba... TLDR too much effort to set up and don't reflect environment.

Outdated Software, Nationwide Chaos: United Grounds Flights After Meltdown (allchronology.com)

Infinite Pixels (meyerweb.com)

Baltimore Assessments Accidentally Subsidize Blight–and How We Can Fix It (progressandpoverty.substack.com)

New AI Coding Teammate: Gemini CLI GitHub Actions (blog.google)

Arm Desktop: x86 Emulation (marcin.juszkiewicz.com.pl)

Laptop Support and Usability (LSU): July 2025 Report from the FreeBSD Foundation (github.com)

How AI Conquered the US Economy: A Visual FAQ (derekthompson.org)

We replaced passwords with something worse (blog.danielh.cc)

Windows XP Professional (win32.run)

An LLM does not need to understand MCP (hackteam.io)

GoGoGrandparent (YC S16) Is Hiring Back End and Full-Stack Engineers

Show HN: Stasher – Burn-after-read secrets from the CLI, no server, no trust (github.com)

Leonardo Chiariglione: “I closed MPEG on 2 June 2020” (leonardo.chiariglione.org)

Global Trade Dynamics (alhadaqa.github.io)

Honesty Boxes in Scotland (2024) (awayfromtheordinary.com)

The Whispering Earring (Scott Alexander) (croissanthology.com)

Sweatshop Data Is Over (mechanize.work)

Claude Code IDE integration for Emacs (github.com)

Cracking the Vault: How we found zero-day flaws in HashiCorp Vault (cyata.ai)

PastVu: Historical Photographs on Current Maps (pastvu.com)

Show HN: Aura – Like robots.txt, but for AI actions (github.com)

AI Ethics is being narrowed on purpose, like privacy was (nimishg.substack.com)

Debounce (developer.mozilla.org)

Children's movie leads art historian to long-lost Hungarian masterpiece (2014) (theguardian.com)

Splatshop: Efficiently Editing Large Gaussian Splat Models (momentsingraphics.de)

Project Hyperion: Interstellar ship design competition (projecthyperion.org)

Maybe we should do an updated Super Cars (spillhistorie.no)

Did Craigslist decimate newspapers? Legend meets reality (poynter.org)

Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs (baseten.co)

Rules by which a great empire may be reduced to a small one (1773) (founders.archives.gov)

Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model (github.com)

Synthetic Biology for Space Exploration (nature.com)

A candidate giant planet imaged in the habitable zone of α Cen A (arxiv.org)

Litestar is worth a look (b-list.org)

Jules, our asynchronous coding agent (blog.google)

Writing a Rust GPU kernel driver: a brief introduction on how GPU drivers work (collabora.com)

Google denies AI search features are killing website traffic (techcrunch.com)

We'd be better off with 9-bit bytes (pavpanchekha.com)

You know more Finnish than you think (dannybate.com)

A fast, growable array with stable pointers in C (danielchasehooper.com)

Photographer spends years on street corner capturing same commuters daily (2017) (mymodernmet.com)

Underused Techniques for Effective Emails · Refactoring English (refactoringenglish.com)

The Bluesky Dictionary (avibagla.com)

The secret system Hamas uses to pay government salaries (bbc.com)

What is the average length of a queue of cars? (2023) (e-dorigatti.github.io)

You Don't Need Monads (muratkasimov.art)

What Happens to Public Media Now? (newyorker.com)

About AI (priver.dev)

Multics (multicians.org)

Comptime.ts: compile-time expressions for TypeScript (comptime.js.org)

Automated Test Failures in CICD – what is true cost?

Comments (1)