Branch Privilege Injection: Exploiting branch predictor race conditions

244 alberto-m 89 5/13/2025, 4:44:51 PM comsec.ethz.ch ↗

Comments (89)

HeliumHydride · 2h ago
https://scholar.harvard.edu/files/mickens/files/theslowwinte...

"Unfortunately for John, the branches made a pact with Satan and quantum mechanics [...] In exchange for their last remaining bits of entropy, the branches cast evil spells on future generations of processors. Those evil spells had names like “scaling-induced voltage leaks” and “increasing levels of waste heat” [...] the branches, those vanquished foes from long ago, would have the last laugh."

Hackbraten · 2h ago
I love James Mickens!

https://www.usenix.org/system/files/1401_08-12_mickens.pdf

> The Mossad is not intimidated by the fact that you employ https://. If the Mossad wants your data, they’re going to use a drone to replace your cellphone with a piece of uranium that’s shaped like a cellphone, and when you die of tumors filled with tumors, […] they’re going to buy all of your stuff at your estate sale so that they can directly look at the photos of your vacation instead of reading your insipid emails about them.

wood_spirit · 1h ago
So this is where they got the pager and walkie talkie ideas from
bee_rider · 2h ago
The bit about vast matrices shows some silver lining though; it turns out John’s little brother figured out how to teach those matrices to talk like a person.
yvdriess · 10m ago
Yes but those transistors moved to greener pastures.
progval · 4h ago
dang · 3h ago
Thanks! We've changed the URL above from the university press release (https://ethz.ch/en/news-and-events/eth-news/news/2025/05/eth...) to that first link.
ncr100 · 3h ago
Impact illustration:

> [...] the contents of the entire memory to be read over time, explains Rüegge. “We can trigger the error repeatedly and achieve a readout speed of over 5000 bytes per second.” In the event of an attack, therefore, it is only a matter of time before the information in the entire CPU memory falls into the wrong hands.

formerly_proven · 3h ago
Prepare for another dive maneuver in the benchmarks department I guess.
cenamus · 3h ago
And if not, why did they introduce severe bugs for a tiny performance improvement?
bloppe · 3h ago
It's not tiny. Speculative execution usually makes code run 10-50% faster, depending on how many branches there are
bee_rider · 3h ago
Yeah… folks who think this is just some easy to avoid thing should go look around and find the processor without branch prediction that they want to use.

On the bright side, they will get to enjoy a much better music scene, because they’ll be visiting the 90’s.

titzer · 13m ago
That's a vast underestimate. Putting in lfence before every branch is on the order of 10X slowdown.
trebligdivad · 3h ago
Thanks! It would be great if someone could update the title URL to that blog post; the press release is worse than useless.
dang · 3h ago
trebligdivad · 2h ago
Thanks!
mettamage · 4h ago
Good to see Kaveh Razavi, he used to teach at my uni in the Vrije Universiteit in Amsterdam :) The course Hardware Security was crazy cool and delved into stuff lijke this.
markus_zhang · 3h ago
I checked out this course (and another one from Vrije about malware) a couple of years ago, back then there was very little public info about the courses.

Do you know if there is any official recording or notes online?

Thanks in advance.

thijsr · 35m ago
As far as I am aware, the course material is not public. Practical assignments are an integral part of the courses given by the VUSEC group, and unfortunately those are difficult to do remotely without the course infrastructure.

The Binary and Malware Analysis course that you mentioned builds on top of the book "Practical Binary Analysis" by Dennis Andriesse, so you could grab a copy of that if you are interested.

mettamage · 18m ago
Ah yea, he gave a guest lecture on how he hacked a botnet!

More info here: https://krebsonsecurity.com/2014/06/operation-tovar-targets-...

it's been a while back :)

mettamage · 20m ago
No, but last time I checked you can be a contracted student for 1200 euro's.

If I knew what I was getting into at the time, I'd do it. I did pay for extra, but in my case it was the low Dutch rate, so for me it was 400 euro's to follow hardware security, since I already graduated.

But I can give a rough outline of what they taught. It has been years ago but here you go.

Hardware security:

* Flush/Reload

* Cache eviction

* Spectre

* Rowhammer

* Implement research paper

* Read all kinds of research papers of our choosing (just use VUSEC as your seed and you'll be good to go)

Binary & Malware Analysis:

* Using IDA Pro to find the exact assembly line where the unpacker software we had to analyze unpacked its software fully into memory. Also we had to disable GDB debug protections. Something to do with ptrace and nopping some instructions out, if I recall correctly (look, I only low level programmed in my security courses and it was years ago - I'm a bit flabbergasted I remember the rough course outlines relatively well).

* Being able to dump the unpacked binary program from memory onto disk. Understanding page alignment was rough. Because even if you got it, there were a few gotcha's. I've looked at so many hexdumps it was insane.

* Taint analysis: watching user input "taint" other variables

* Instrumenting a binary with Intel PIN

* Cracking some program with Triton. I think Triton helped to instrument your binary with the help of Intel PIN by putting certain things (like xor's) into an SMT equation or something and you had this SMT/Z3 solver thingy and then you cracked it. I don't remember got a 6 out of 10 for this assignment, had a hard time cracking the real thing.

Computer & Network Security:

* Web securtiy: think XSS, CSRF, SQLi and reflected SQLi

* Application security: see binary and malware analysis

* Network security: we had to create our own packet sniffer and we enacted a Kevin Mitnick attack (it's an old school one) where we had to spoof our IP addresses, figure out the algorithm to create TCP packet numbers - all in the blind without feedback. Kevin in '97 I believe attacked the San Diego super computer (might be wrong about the details here). He noticed that the super computer S trusted a specific computer T. So the assignment was to spoof the address of T and pretend we were sending packets from that location. I think... writing this packet sniffer was my first C program. My prof. thought I was crazy that this was my first time writing C. I was, I also had 80 hours of time and motivation per week. So that helped.

* Finding vulnerabilities in C programs. I remember: stack overflows, heap overflows and format strings bugs.

-----

For binary & malware analsys + computer & network security I highly recommend hackthebox.eu

For hardware security, I haven't seen an alternative. To be fair, I'm not looking. I like to dive deep into security for a few months out of the year and then I can't stand it for a while.

rakingleaves · 1h ago
Anyone know how this relates to the Training Solo attack that was just disclosed? https://www.vusec.net/projects/training-solo/
rini17 · 3h ago
If CPU brach predictor had bits of information readily available to check buffer boundaries and privilege level of the code, all this would be much easier to prevent. But apparently that will only happen when we pry out the void* from the cold C programmers' hands and start enriching our pointers with vital information.
ActorNightly · 1h ago
Or people could just understand the scope of the issue better, and realize that just because something has a vulnerability doesn't mean there is a direct line to an attack.

In the case of speculative execution, you need an insane amount of prep to use that exploit to actually do something. The only real way this could ever be used is if you have direct access to the computer where you can run low level code. Its not like you can write JS code with this that runs on browsers that lets you leak arbitrary secrets.

And in the case of systems that are valuable enough to exploit with a risk of a dedicated private or state funded group doing the necessary research and targeting, there should be a system that doesn't allow unauthorized arbitrary code to run in the first place.

I personally disable all the mitigations because performance boost is actually noticeable.

vlovich123 · 1h ago
> Its not like you can write JS code with this that runs on browsers that lets you leak arbitrary secrets

That's precisely what Spectre and Meltdown were though. It's unclear whether this attack would work in modern browsers but they did reenable SharedArrayBuffer & it's unclear if the existing mitigations for Spectre/Meltdown stimy this attack.

> I personally disable all the mitigations because performance boost is actually noticeable.

Congratulations, you are probably susceptible to JS code reading crypto keys on your machine.

quotemstr · 2h ago
You want CHERI.
ajross · 3h ago
I don't see how you think that will help? It's not about software abstraction, it's about hardware. Changing the "pointer" does nothing to the transistors.

Doing what you want would essentially require a hardware architecture where every load/store has to go through some kind of "augmented address" that stores boundary information.

Which is to say, you're asking for 80286 segmentation. We had that, it didn't do what you wanted. And the reason is that those segment descriptors need to be loaded by software that doesn't mess things up. And it doesn't, it's "just a pointer" to software and amenable to the same mistakes.

rini17 · 1h ago
286 far pointers were used sparingly, to save precious memory. Now we don't have any such problem and there are still unused bits in pointers even on largest 64 bit systems that might be repurposed perhaps. With virtual memory, there are all kinds of hardware supported address mappings and translations and IOMMU already so adding more transistors isn't an issue. The issue is purely cultural as you have just shown, people can't imagine it.
ajross · 55m ago
That's misunderstanding the hardware. All memory access on a 286 was through a segment descriptor, every access done in protected mode was checked against the segment limit. Every single one.

A "far pointer" was, again, a *software* concept where you could tell the compiler that this particular pointer needed to use a different descriptor than the one the toolchain assumed (by convention!) was loaded in DS or SS.

nine_k · 2h ago
Why stop at 80286, consider going back to the ideas of iAPX432, but with modern silicon tech and the ability to spend a few million transistors here and there.

(CHERI already exists on ARM and RISC-V though.)

nottorp · 3h ago
I suppose a CPU that only runs Rust p-code is what the OP is dreaming about...
ajross · 3h ago
Generated rust "p-code" would presumably be isomorphic to LLVM IR, which doesn't have this behavior either and would be subject to the same exploits.

Again, it's just not a software problem. In the real world we have hardware that exposes "memory" to running instructions as a linear array of numbers with sequential addresses. As long as that's how it works, you can demand an out of bounds address (because the "bounds" are a semantic thing and not a hardware thing).

It is possible to change that basic design principle (again, x86 segmentation being a good example), but it's a whole lot more involved than just "Rust Will Fix All The Things".

nottorp · 3h ago
Holy... I need to stop making fun of Rust (*). I keep getting misinterpreted.

(*) ... although I don't think I can abstain ...

smartmic · 3h ago
> Closing these sorts of gaps requires a special update to the processor’s microcode. This can be done via a BIOS or operating system update and should therefore be installed on our PCs in one of the latest cumulative updates from Windows.

Why mention only Windows, what about Linux users?

matja · 2h ago
The Linux kernel has had microcode loading support (`CONFIG_MICROCODE` / `CONFIG_MICROCODE_INTEL`) but many years, but it does require that Intel release the microcode files necessary for distribution maintainers to update the packages, then it should be included in a system update.
ajross · 2h ago
Intel distributes microcode updates for Linux here: https://github.com/intel/Intel-Linux-Processor-Microcode-Dat... , and the distro are all set up to pull from there and distribute automatically.

Not expert enough to know what to look for to see if these particular mitigations are present yet.

rtkwe · 4h ago
I wonder if there's similar gaps in AMD hardware? Seems like speculative execution is simply an extremely hard to patch vulnerability in a share processor space so I wonder how AMD has avoided it.
tmoertel · 4h ago
According to the authors' blog post:

> Does Branch Privilege Injection affect non-Intel CPUs?

> No. Our analysis has not found any issues on the evaluated AMD and ARM systems.

Source: https://comsec.ethz.ch/research/microarch/branch-privilege-i...

pdpi · 3h ago
The short of it is that AMD haven’t “avoided it”. Speculative execution side channels aren’t one vulnerability but rather a whole family of vulnerabilities. This particular one is (apparently) Intel-only, same as Meltdown was, but AMD was also vulnerable to the original Spectre.
bee_rider · 3h ago
Pedantically, speculative execution isn’t the vulnerability, it is a necessary mechanism for every high-performance CPU nowadays (where “nowadays” started, like, around the turn of the century). However, bugs and vulnerabilities in speculative execution engines are very widespread because they are complicated.

There are probably similar bugs in AMD and ARM, I mean how long did these bugs sit undiscovered in Intel, right?

Unfortunately the only real fix is to recognize that you can’t isolate code running on a modern system, which would be devastating to some really rich companies’ business models.

quotemstr · 2h ago
The solution to this particular vulnerability is intuitive to me: snapshot the current privilege level when we enqueue a branch predictor update and carry that snapshot along with the update itself as it flows through the processor's internal buffers. Same problem you might have in software and the same solution, yes?
The28thDuck · 47m ago
Haven’t we been here before? It seems like it’s very similar to the branch prediction exploits of the late 2010s. Is there something particularly novel about this class of exploits?
mettamage · 14m ago
Probably, I haven't had time to delve into the article yet. But ever I first learned about them I got the hunch that they'd never fully go away.

Then people say "no that's not possible, we got security in place."

So then the researchers showcase a new demo where they use their existing knowledge with the same issue (i.e. scaling-induced voltage leaks).

I suspect this will go on and on for decades to come.

yonatan8070 · 3h ago
Just to make sure I got this right, at this point in time there are patches out for all major operating systems that can mitigate this/apply relevant microcode to mitigate it?
201984 · 2h ago

  mitigations=off
Don't care.
matja · 2h ago
"Don't mind me running this piece of WASM in a webworker to collect all the useful encryption keys and cookies in your RAM..."
201984 · 27m ago
Has even a single web exploit ever been found in the wild? Until then, I'm not going to worry and probably not even then.
bee_rider · 1h ago
Yeah, he should really turn mitigations on, so that when running arbitrary code from the internet he can be subject to 9999 vulnerabilities, instead of 10,000.
darkmighty · 1h ago
There are many kinds of vulnerabilities. Most are pretty mundane afaict. Breaking sandboxes and reading out your entire RAM is basically game over, existential vulnerability (second only to arbitrary code execution, though it can give you SSH keys I guess).

The mitigating factor is actually that you don't go to malicious websites all the time, hopefully. But it happens, including with injected code on ads and stuff that may enabled by secondary vulnerabilities.

johnnyjeans · 1h ago
Uncaught ReferenceError: WebAssembly is not defined
vlovich123 · 1h ago
You don't need WASM to deploy Spectre/Meltdown. Vanilla JS works just fine which is what was demonstrated in the original paper.
layer8 · 3h ago
dzdt · 3h ago
The end-user processor slowdowns from Spectre and Meltdown mitigations were fairly substantial. Has anyone seen an estimate of how much the microcode updates for this new speculative vulnerability are going to cost in terms of slowdown?
leonidasv · 3h ago
> Our performance evaluation shows up to 2.7% overhead for the microcode mitigation on Alder Lake. We have also evaluated several potential alternative mitigation strategies in software with overheads between 1.6% (Coffee Lake Refresh) and 8.3% (Rocket lake)

https://comsec.ethz.ch/research/microarch/branch-privilege-i...

dzdt · 3h ago
Thanks, missed that! I remember seeing benchmarks showing like 15% slowdown from Spectre/Meltdown mitigations, so this is not as bad as that, but that is on top of the other too I guess...
margorczynski · 3h ago
I wonder if there's any way to recover for Intel. They don't have anything worthwhile on the market, R&D takes a lot of time and their foundries are a constant source of losses as they're inferior compared to the competition.

On top of that x86 seems to be pushed out more and more by ARM hardware and now increasingly RISC-V from China. But of course there's the US chip angle - will the US, especially after the problems during Covid, let a key manufacturer like Intel bite the dust?

chneu · 3h ago
Intel really isn't in as much trouble as tech blogs like to act.

It's not great but lol the sensationalism is hilarious.

Remember, gamers only make up a few percentage of users for what Intel makes. But that's what you hear about the most. One or two data center orders are larger than all the gaming cpus Intel will sell in a year. And Intel is still doing fine in the data center market.

Add in that Intel still dominates the business laptop market which is, again, larger than the gamer market by a pretty wide margin.

WaxProlix · 3h ago
You're right about gamers, but other verticals are looking bad for Intel, too.

The two areas you mention (data center, integrated OEM/mobile) are the two that are most supply chain and business-lead dependent. They center around reliable deliveries of capable products at scale, hardware certifications, IT department training, and organizational bureaucracy that Intel has had captured for a long time.

But!

Data center specifically is getting hit hard from AMD in the x86 world and ARM on the other side. AWS's move to Graviton alone represents a massive dip in Intel market share, and it's not the only game in town.

Apple is continuing to succeed in the professional workspace, and AMD's share of laptop and OEM contracts just keeps going up. Once an IT department or their chosen vendor has retooled to support non-Intel, that toothpaste is not going back into the tube - not fully, at least.

For both of these, AMD's improvement in reliability and delivery at scale will be bearing fruit for the next decade (at Intel's expense), and the mindshare, which gamers and tech sensationalism are indicators for, has already shifted the market away from an Intel-dominated world to a much more competitive one. Intel will have to truly compete in that market. Intel has stayed competitive in a price-to-performance sense by undermining their own bottom line, but that lever only has so far it can be pulled.

So I'm not super bullish on Intel, sensationalism aside. They have a ton of momentum, but will need to make use of it ASAP, and they haven't shown an ability to do that so far.

layer8 · 3h ago
Intel still has well over 70% x86 market share. They have a long runway. Arm had only 15% datacenter market share last year, and still hasn’t made much headway in the Windows market.
freeone3000 · 2h ago
Arm is making huge gains though — five years ago they had less than 5%. The future of x86 is not bright.
baq · 2h ago
x86 vs arm doesn’t matter. Hardware matters. Intel needs to make the best cpu again. It can be x86, it can be arm, it can be risc-v.
adgjlsfhk1 · 1h ago
Arm vs x86 matters a lot for Intel since they don't make Arm CPUs. x86 used to be a massive moat for Intel/AMD. The rise of ARM market-share means that that moat is draining. 10 years ago, AMD and IBM were the only competition (and they were both in rough shape). Now Intel is competing against AMD, NVidia, Qualcom, Amazon, and Arm. Even if Intel can make the best CPU again, they no longer can charge monopoly prices for it. If you have a 10% faster CPU, that only lets you charge a small premium over everyone else.
emkoemko · 3h ago
didn't i read something about apple,nvidia and other companies looking to use their foundries? why would they do that if its inferior or was that something else?
greenavocado · 3h ago
Because there's nothing else in America
porridgeraisin · 3h ago
I guess it depends on your expectations. Will they be fine as a company? I think yes. Will they be as prominent as they were at different points in their history? I think not.

Product aside, from a shareholder/business point of view (I like to think of this separately these days as financial performance is becoming less and less reflective of the end product) I think they are too big to fail.

gitroom · 1h ago
yeah this just makes me wanna see real world numbers on the slowdown, cuz honestly all these microcode fixes feel like trading off years of speed for maybe a little more peace of mind - you ever think well actually move off this cycle or is it just here to stay?
tannhaeuser · 3h ago
> All intel processors since the 9th generation (Coffee Lake Refresh) are affected by Branch Privilege Injection. However, we have observed predictions bypassing the Indirect Branch Prediction Barrier (IBPB) on processors as far back as 7th generation (Kaby Lake).

From that piece of text on the blog, I don‘t quite unterstand if Kaby Lake CPUs are affected or not.

chrisweekly · 3h ago
I interpret it as including Kaby Lake.
fwip · 2h ago
At least some Kaby Lake CPUs are affected, but they can't say for sure that all of them are.
lostmsu · 3m ago
No, I think they are saying that they can only demonstrate exploit on Coffee Lake Refresh and later, but the issue that let them create exploit exists all the way back to Kaby Lake. So they are also probably exploitable, but this specific exploit does not target them.
j45 · 3h ago
Since the cloud is someone else's computer, and someone else's shared CPU, is cloud hosting (including vps) potentially impacted?

Look forward to learning how this can be meaningfully mitigated.

matja · 2h ago
For reads across different VMs on the same CPU, theoretically TME-MK could mitigate the usefulness of the memory reads by having each VM access memory using a different memory encryption key, but I don't know of any hypervisors that implement this.

AMD has had SEV support in QEMU for a long time, which some cloud hosting providers use already, that would mitigate any such issue if it occurred on AMD EPYC processors.

andrewla · 3h ago
Intel claims [1] that they already have microcode mitigation. Like Spectre and Meltdown this is likely to have performance implications.

[1] https://www.intel.com/content/www/us/en/security-center/advi...

j45 · 2h ago
Spectre and Meltdown had some pretty big performance hits in the beginning. Wonder how much it will differ here in real world, third party (and independent) testing.
whatever1 · 3h ago
It’s dead, can you please stop stubbing it?
anonymars · 3h ago
I thought I understand these words, yet I don't understand what you mean
arghwhat · 4h ago
> On an up to date Ubuntu 24.04

So not very up to date, but I suppose mitigations haven't changed significantly upstream since then.

necubi · 4h ago
24.04 is the most recent LTS (long term support) release; it's what users are meant to be running for anything important
arghwhat · 39m ago
My point is that it is not representative of the current state of the kernel.

The kernel has nothing to do with Ubuntu, its release schedule and LTS's. Distro LTS releases also often mean custom kernels, backports, hardware enablement, whatnot, which makes it a fork, so unless were analyzing Ubuntu security rather than Linux security, mainline should be used.

FirmwareBurner · 3h ago
>it's what users are meant to be running for anything important

Anything important requires TempleOS.

thomasdziedzic · 4h ago
That version is significant because it is the latest LTS release. Most servers use LTS releases.
blueflow · 4h ago
Ubuntu 24.04 is the current LTS release. Our are you intending to say that Ubuntu, regardless of version, is not up to date?

Edit: "LTS" added due to popular demand

arghwhat · 36m ago
I am saying that any version of Ubuntu is not representative of the mainline kernel, which is what is relevant when it comes to analyzing current mitigations.

Distro LTS releases often mean custom kernels, backports, hardware enablement, whatnot, which makes it effectively a fork.

Unless were interested in discovering kernel variation discrepancies, its more interesting to analyze mainline.

pdpi · 3h ago
You need a qualifier there — the latest Ubuntu release is 25.04, but 24.04 is the current LTS release.
razemio · 3h ago
It is up to date, with security patches and fixes. That is obviously what is relevant here. That is why the parent comment got down voted, since it is up to date in context of a security vulnerability. It should be even more secure, since new software versions might introduce unknown attack vectors.
7bit · 3h ago
There is a difference between an up2date Ubuntu 24.04 and an up2date Ubuntu.

And as security updates are back ported to all supported versions - and 24.04 being an LTS release, it is as up2date as it gets.

If you're being pedantic, be the right kind of pedantic ;)

arghwhat · 25m ago
The problem is that it's downstream backports and hardware enablement - you're running an old forked artisinal kernel maintained by Canonical, you will only get bugfixes if known to be severe enough to be flagged, and all this patching deviates it from mainline and can itself introduce new security vulnerabilities not present in mainline.

This differs from an actual later release which is closer to mainline and includes all newer fixes, including ones that are important but weren't flagged, and with less risk of having new downstream bugs.

If you're going to fight pedantism by being pedantic, better be the right kind of pedantic. ;)

fwip · 3h ago
24.04 is an LTS (long term support) release, so it receives updates, including security updates, for much longer than a regular release. I believe it's a 5-year support window, and longer if you shell out for paid support.
arghwhat · 29m ago
These updates mean that you are no longer running a mainline kernel, but an Ubuntu fork with whatever backports and hardware enablement (and new bugs!) this might introduce. This is also true for other software.

LTS does not mean you get all updates, it only means you get to drag your feet for longer with random bugfixes. Only the latest release has updates.