Meta and Yandex Disclosure: Covert Web-to-App Tracking via Localhost on Android (localmess.github.io)

Back in the day I was sponsoring work on libuinet in order to move enough of the kernel needed for a security appliance to libuinet to underpin a performance improvement for pfsense.

Then Tencent did what they did, Patrick reacted as he did and that was over.

We pivoted to VPP. But back in 2016 it also needed a lot of work.

Eduard · 38m ago

Hacker News is the kind of place where you can have _this_ submission (PRC-sponsored Tencent-owned network devkit) on the front page next to a submission about how PRC-sponsored cybercrime group Salt Typhoon pwned 'nearly every American': https://news.ycombinator.com/item?id=45074157

ramesh31 · 3h ago

Thanks for the F-Stack!

alexdns · 7h ago

It was considered innovative when it was first shared here eight years ago.

nurumaik · 6h ago

Anything more innovative happened since (honestly curious)?

yxhuvud · 1h ago

Well, io_uring came along and removed a lot of the incentive.

js4ever · 5h ago

I don't think so, but my guess is raw performance rarely matters in the real world.

I once explored this, hitting around 125K RPS per core on Node.js. Then I realized it was pointless, the moment you add any real work (database calls, file I/O, etc.), throughput drops below 10K RPS.

rivetfasten · 4h ago

It's always a matter of chasing the bottleneck. It's fair to say that network isn't the bottleneck for most applications. Heuristically, if you're willing to take on the performance impacts of a GC'd language you're probably already not the target audience.

Zero copy is the important part for applications that need to saturate the NIC. For example Netflix integrated encryption into the FreeBSD kernel so they could use sendfile for zero-copy transfers from SSD (in the case of very popular titles) to a TLS stream. Otherwise they would have had two extra copies of every block of video just to encrypt it.

Note however that their actual streaming stack is very different from the application stack. The constraint isn't strictly technical: ISP colocation space is expensive, so they need to have the most juiced machines they can possibly fit in the rack to control costs.

There's an obvious appeal to accomplishing zero-copy by pushing network functionality into user space instead of application functionality into kernel space, so the DPDK evolution is natural.

pclmulqdq · 5m ago

TCP is generally zero-copy now. Zero-copy with io_uring is also possible.

AF_XDP is also another way to do high-performance networking in the kernel, and it's not bad.

DPDK still has a ~30% advantage over an optimized kernel-space application with a huge maintenance burden. A lot of people reach for it, though, without optimizing kernel interfaces first.

antoinealb · 5h ago

The goal of this kind of system is not to replace the application server. This is intended to work on the data plane where you do simple operations but do them many time per second. Think things like load balancers, cache server, routers, security appliances, etc. In this space Kernel Bypass is still very much the norm if you want to get an efficient system.

baruch · 3h ago

We do storage systems and use DPDK in the application, when the network IS the bottleneck it is worth it. Saturating two or three 400gbps NICs is possible with DPDK and the right architecture that makes the network be the bottleneck.

eqvinox · 4h ago

> In this space Kernel Bypass is still very much the norm if you want to get an efficient system.

Unless you can get an ASIC to do it, then the ASIC is massively preferrable; just the power savings generally¹ end the discussion. (= remove most routers from the list; also some security appliances and load balancers.)

¹ exceptions confirm the rule, i.e. small/boutique setups

gonzopancho · 1h ago

ASICs require years to develop and aren’t flexible once deployed

nsteel · 33m ago

Even the ones supporting things like P4?

jandrewrogers · 5h ago

Storage and database doesn’t have to be that slow, that’s just architecture. I have database servers doing 10M RPS each, which absolutely will stress the network.

We just do the networking bits a bit differently now. DPDK was a product of its time.

klaussilveira · 5h ago

https://asynciobench.github.io/

ozgrakkurt · 5h ago

You can apparently do 100gbit/sec on a single thread over ethernet with io uring.

touisteur · 4h ago

Recently did 400gb/s on a single core / 4x100gb nics (or just the one 400g nic, too) with dpdk. Mind you it's with jumbo frames and constant packet size for hundreds of mostly synchronized streams... You won't process each packet individually, mostly put them in queues for later batch-process by other cores. Amazing for data acquisition applications using UDP streams.

I keep watching and trying io_uring and still can't make it work as fast with simple code as consistently for those use cases. AF_XDP gets me partly there but then you're writing ebpf... might as well go full-dpdk.

Maybe it's a skill issue on my part, though. Or just a well-fitting niche.

ozgrakkurt · 1h ago

Sounds super cool but dpdk sounds like it won't be worth the difficulty from what I read so far.

I also want to get into socket io using io_uring in zig. I'll try to apply everything I found in liburing wiki [0] and see how much I can get (max hardware I have is 10gbit/s).

Seems like there is: - multi-shot requests - register_napi on uring instance - zero copy receive/send. (Probably won't be able to get into it)

Did you already try these or are there other configurations I can add to improve it?

[0]: https://github.com/axboe/liburing/wiki/io_uring-and-networki...

lossolo · 16m ago

You don't even need io_uring for 10 gbit/s, epoll will do that easily, unless you have very niche workload.

lossolo · 3h ago

Any numbers for io_uring with 4x100gb nics in your tests?

Kamillaova · 3h ago

Of course, but when working directly with the NIC, such speeds can be achieved with smaller packets, getting even closer to the linerate.

Are we decentralized yet? (arewedecentralizedyet.online)

Cognitive load is what matters (github.com)

The Rise of Hybrid PHP: Blending PHP with Go and Rust (yekdeveloper.com)

Can cheaper lasers handle short distances? (semiengineering.com)

Agent Client Protocol (ACP) (agentclientprotocol.com)

AI models need a virtual machine (blog.sigplan.org)

Do the simplest thing that could possibly work (seangoedecke.com)

Condor's Cuzco RISC-V Core at Hot Chips 2025 (chipsandcheese.com)

Bi-directional accountability: A leadership shift most organizations avoid (alnewkirk.com)

SQL Design Patterns (vadimtropashko.wordpress.com)

Pig lung transplanted into a human (sciencealert.com)

U.S. guided-missile cruiser crosses Panama Canal, warships deployed to Venezuela (cbsnews.com)

From multi-head to latent attention: The evolution of attention mechanisms (vinithavn.medium.com)

You Have to Feel It (mitchellh.com)

Show HN: Hacker News em dash user leaderboard pre-ChatGPT (gally.net)

Show HN: I made an Animal Crossing style letter editor (acmail.idreesinc.com)

Hardening Firefox – a checklist for improved browser privacy (andrewmarder.net)

The space race is transforming Southern California's economy – again (latimes.com)

Nokia’s legendary font makes for a great user interface font (osnews.com)

Sleeper AI agents and how Anthropic detects them [video] (youtube.com)

GAO warns of privacy risks in using facial recognition in rental housing (files.gao.gov)

John Carmack's arguments against building a custom XR OS at Meta (twitter.com)

Meta and Yandex Disclosure: Covert Web-to-App Tracking via Localhost on Android (localmess.github.io)

Lisp from Nothing, Second Edition (t3x.org)

University of Cambridge Cognitive Ability Test (planning.e-psychometrics.com)

FBI cyber cop: Salt Typhoon pwned 'nearly every American' (theregister.com)

F-Stack – A network development kit with high performance based on DPDK (f-stack.org)

Bcachefs Goes to "Externally Maintained" (lwn.net)

De minimis exemption ends (washingtonpost.com)

Essential Coding Theory [pdf] (cse.buffalo.edu)

How to design a DBMS for Telco requirements (mikaelronstrom.blogspot.com)

The Theoretical Limitations of Embedding-Based Retrieval (arxiv.org)

The Grammar According to West (dwest.web.illinois.edu)

Deploying DeepSeek on 96 H100 GPUs (lmsys.org)

Why did dlclose not unload the library? (2023) (kishoreganesh.com)

Taylor Otwell: What 14 Years of Laravel Taught Me About Maintainability (maintainable.fm)

Hermes 4 (hermes4.nousresearch.com)

Wikipedia as a Graph (wikigrapher.com)

The Delusion Machine – What happened when I fed my soul into an LLM (hedgehogreview.com)

How do I get into the game industry (garry.net)

Why Romania excels in international Olympiads (palladiummag.com)

Emulating aarch64 in software using JIT compilation and Rust (pitsidianak.is)

Income Equality in Nordic Countries: Myths, Facts, and Lessons (aeaweb.org)

The web does not need gatekeepers: Cloudflare’s new “signed agents” pitch (positiveblue.substack.com)

Two mystery customers alone responsible for ~40% of Nvidia's quarterly revenue (fortune.com)

Thunder Compute (YC S24) Is Hiring (ycombinator.com)

Trying to get error backtraces in Rust libraries right (iroh.computer)

I Love Reading 1980s Computer Magazines, and So Should You (2021) (wired.com)

God created the real numbers (ethanheilman.com)

Show HN: Sosumi.ai – Convert Apple Developer docs to AI-readable Markdown (sosumi.ai)

F-Stack – A network development kit with high performance based on DPDK

Comments (22)