UdeM researchers confirm a fifth potentially habitable planet around L 98-59 (nouvelles.umontreal.ca)

If you have a single TCP connection, all the data flows through that connection, ultimately serializing at least some of the processing. Given that the workers are just responding with OK, no matter how many CPU cores you give to that you're still bound by the throughput of the IO thread (well by the minimum of the client and server IO thread). If you want more than 1 IO thread to share the load, you need more than one TCP connection.

lacop · 2h ago

Somewhat related, I'm running into a gRPC latency issue in https://github.com/grpc/grpc-go/issues/8436

If request payload exceeds certain size the response latency goes from network RTT to double that, or triple.

Definitely something wrong with either TCP or HTTP/2 windowing as it doesn't send the full request without getting ACK from server first. But none of the gRPC windowing config options nor linux tcp_wmem/rmem settings work. Sending one byte request every few hundred milliseconds fixes it by keeping the gRPC channel / TCP connection active. Nagle / slow start is disabled.

littlecranky67 · 2h ago

sounds like classic tcp congestion window scaling delay. Sounds like your payload exceeds 10x initcwnd.

lacop · 2h ago

Doesn't initcwnd only apply as the initial value? I don't care that the first request on the gRPC channel is slow, but subsequent requests on the same channel reuse the TCP connection and should have larger window size. This works as long as the channel is actively being used, but after short inactivity (few hundred ms, unsure exactly) something appears to revert back.

littlecranky67 · 2h ago

Yes, in case of hot tcp connections congestion control should not be the issue.

lacop · 1h ago

Yeah that was my understanding too, hence I filed the bug (actually duplicate of older bug that was closed because poster didn't provide reproduction).

Still not sure if this is linux network configuration issue or grpc issue, but something is for sure broken if I can't send a ~1MB request and get response within roughly network RTT + server processing time.

eivanov89 · 2h ago

That's indeed interesting, thank you for sharing.

xtoilette · 4h ago

classic case of head of line blocking!

yuliyp · 3h ago

I don't think this is head-of-line blocking. That is, it's not like a single slow request causes starvation of other requests. The IO thread for the connection is grabbing and dispatching data to workers as fast as it can. All the requests are uniform, so it's not like one request would be bigger/harder to handle for that thread.

otterley · 2h ago

> First, we checked the number of TCP connections using lsof -i TCP:2137 and found that only a single TCP connection was used regardless of in-flight count.

It's head-of-line blocking. When requests are serialized, the queue will grow as long as the time to service a request is longer than the interval between arriving requests. Queue growth is bad if sufficient capacity exists to service requests in parallel.

AccuWeather to discontinue free access to Core Weather API (developer.accuweather.com)

The Promised LAN (tpl.house)

Cara – High Precision Robot Dog Using Rope (aaedmusa.com)

How to increase your surface area for luck (usefulfictions.substack.com)

Trip to moon required Apollo 11 crew to sign US Customs declaration to enter US (magazine.uc.edu)

Hyperpb: 10x faster dynamic Protobuf parsing that's faster than generated code (buf.build)

UdeM researchers confirm a fifth potentially habitable planet around L 98-59 (nouvelles.umontreal.ca)

Stop Building AI Tools Backwards (hazelweakly.me)

What to Expect from Debian/Trixie (michael-prokop.at)

FastVLM: Efficient Vision Encoding for Vision Language Models (machinelearning.apple.com)

The First Photograph Ever Taken (1826) (openculture.com)

You can now disable all AI features in Zed (zed.dev)

Optery (YC W22) Is Hiring in Engineering, Legal, Sales, Marketing (U.S., Latam) (optery.com)

Using Uninitialized Memory for Fun and Profit (research.swtch.com)

Interactive Programming in C (2014) (nullprogram.com)

Show HN: The missing link of a bookstore's tech stack (bookhead.net)

Checklists are hard (but still a good thing) (utcc.utoronto.ca)

Manticore Search: Fast, efficient, drop-in replacement for Elasticsearch (github.com)

SIMD Perlin Noise: Beating the Compiler with SSE (scallywag.software)

Reverse engineering GitHub Actions cache to make it fast (blacksmith.sh)

Cerebras launches Qwen3-235B, achieving 1.5k tokens per second (cerebras.ai)

ICE block founder's wife fired by DOJ in retaliation for the app (newsweek.com)

AI groups spend to replace low-cost 'data labellers' with high-paid experts (ft.com)

The Surprising gRPC Client Bottleneck in Low-Latency Networks (blog.ydb.tech)

Using Radicle CI (radicle.xyz)

Major Rule About Cooking Meat Turns Out to Be Wrong (seriouseats.com)

Geocities Backgrounds (pixelmoondust.neocities.org)

AI overviews cause massive drop in search clicks (arstechnica.com)

Tram Trains (worksinprogress.news)

Reversing a Fingerprint Reader Protocol (2021) (blog.th0m.as)

SQL Injection as a Feature (idiallo.com)

Proxmox Donates €10k to the Perl and Raku Foundation (perl.com)

Show HN: Self-updating MCP server for official pip, uv, poetry and conda docs (github.com)

Checking Out CPython 3.14's remote debugging protocol (rtpg.co)

Employee – CEO pay gap historically wide (cnn.com)

When Is WebAssembly Going to Get DOM Support? (queue.acm.org)

R/AI: I'm officially in the "I won't be necessary in 20 years" camp (old.reddit.com)

Extending Emacs with Fennel (2024) (andreyor.st)

Optimizations That Aren't (zeux.io)

AI coding agents are removing programming language barriers (railsatscale.com)

Mathematics for Computer Science (2024) (ocw.mit.edu)

Rescuing two PDP-11s from a former British Telecom underground shelter (2023) (forum.vcfed.org)

AWS merges malicious PR into Amazon Q (lastweekinaws.com)

Just Open Sourced NeuralAgent: The AI Agent That Lives on Your Desktop (github.com)

Apple launches $20 monthly AppleCare One subscription that covers 3 devices (techcrunch.com)

Qwen3-Coder: Agentic coding in the world (qwenlm.github.io)

Org tutorials (orgmode.org)

Cops say criminals use a Google Pixel with GrapheneOS – I say that's freedom (androidauthority.com)

Migrating a ZFS Pool from RAIDZ1 to RAIDZ2 (mtlynch.io)

Alphabet 2025 Q2 Earnings Release [pdf] (abc.xyz)

The Surprising gRPC Client Bottleneck in Low-Latency Networks

Comments (10)