Ask HN: How do you measure "AI slop"?

4 crakhamster01 1 7/31/2025, 8:48:29 AM

Recently, my employer has been pushing hard for LLM adoption across eng, with an expectation of increased productivity. Eng has followed suit, and as a result I've been getting a lot more PRs that are clearly AI generated. 100 line diffs that could have been 10, missed error cases, breaking convention. It's not just from junior engineers, but often from other senior engineers now.

With our incentive structures, it doesn't seem like there's a great way to prevent this decline in quality. It's been hard for me to quantify _why_ "slop" is bad, but my gut feelings are that:

  1. The codebase becomes unreadable to human engineers.

  2. Having more bad examples in the codebase creates a negative feedback loop for future LLM changes. And maybe this is the new norm, but ->

  3. Once enough slop gets in, future incidents/SEVs become increasingly more difficult to resolve.

(3) feels like the only reason that has tangible business impact. Even if it did occur, I don't know if it would be possible to tie the slow response/loss in revenue to AI slop.

I’ve seen other posts lamenting the ills of vibe coding, but is there a concrete way to justify code quality in the era of LLMs? My thoughts are that it might be useful to track some code quality metric like cyclomatic complexity, and see if there’s any correlation with regressions over time, but that feels kind of thin (and retroactive).

Comments (1)

dvrp · 18h ago

I think about it from an entropy POV: how much signal is the code/text transmitting?

You can tell it’s AI for a surprisingly high amount of LLM outputs. If you feel it’s regurgitating what you or your team already know, it’s slop.

This gets tricky of course, but it’s a tricky question. Though, I don’t think objective metrics work (in your case Cyclomatic Complexity), because information is relative by nature. What’s slop to someone is high-quality code or new information to someone else.

I launched 17 side projects. Result? I'm rich in expired domains

Nova: A New Web Framework for Erlang

New budget financial API, based on EDGAR data

How Remote Work Died: A Girardian Tragedy in Corporate America

Ask HN: Small Utility App Monetization

Ask HN: Local LLM agents on Jetson/RPi without a heavy runtime

Claude Code weekly rate limits

Has AI coding gone too far? I feel like I'm losing control of my own projects

Ask HN: Advise for technical solo founders trying to secure venture capital?

Ask HN: Tell me your horror story about blowing money on AI

Ask HN: Catching Up with Current Datacenters

Ask HN: State of the art with local LLMs and agents

Ask HN: What are you working on? (July 2025)

Ask HN: Are developers sad about AI writing more of their code?

Tell HN: Add "NSFW" words in your Google query to avoid AI summary

Ask HN: How will the OSA affect small Mastodon instances?

Google Maps Reviews in Germany Are Basically Dead

Tell HN: Google Denies OS Support for Pixel from Non-Licenced Retailer

Ask HN: What do you do with all your unused tech "swag"?

Ask HN: Should I pay for marketing to this B2C SaaS product? (~8% paying rate)

Warp.dev Terminal – Overpriced, Buggy, and AI-Sabotaged My Code

What I have learned about startups from building my own

Drafting Software Recommendation

Ask HN: How are you using LLMs?

Ask HN: What is the web template everyone uses for startups?

Ask HN: Whats your best workflows to draft legal agreements without lawyers?

Ask HN: What's your experience been with Claude Code subagents?

Ask HN: Is university still credible?

An app that schedules your most important tasks around your peak energy levels

Ask HN: How do you measure "AI slop"?

Comments (1)