Booting the RP2350 from UART

74 hugolundin 12 5/11/2025, 2:26:49 PM pfister.dev ↗

Comments (12)

vardump · 16h ago
One could also send a binary stub that sets up fast CPU clock speed and decompresses the rest of the firmware at the RP2350 side. Should be even faster.

Just like old C64 decrunchers and Amiga PowerPacker. Or Fabrice Bellard's LZEXE. (Is there anything that guy did NOT write?!)

zoobab · 2h ago
The CH32V003 has also a UART bootloader, but for some reason there is no open source command line client to do something with it. WCH has a Windows GUI though.
bluehex · 8h ago
This is awesome. I've had similar ideas but wasn't able to do any prototyping yet as I only have Pico 2 boards that don't expose the CSn pin in the pinout.

Rather than UART booting every time I thought it might be nice to use UART Boot just as a way to deliver the firmware update to the sub chip - so the UART image you load would just be a program that accepts a larger image (over UART again) and would write to the flash for subsequent boots. I think that would get around the SRAM and boot time downsides the author mentioned. Is there a reason this might not work?

vardump · 1h ago
That requires having a flash chip in the first place. By booting via UART you don't need any flash at all.
duskwuff · 15h ago
In principle, you could boot the RP2040 over SWD. It'd be much more difficult to code, but the possibility is there...
gadgetoid · 3h ago
We (Pimoroni) actually shipped this technique in PicoVision, used to load the “GPU” firmware (an RP2040 used to offload the HDMI signal generation) at runtime-

https://github.com/pimoroni/picovision/blob/main/drivers/dv_...

flyingcircus3 · 12h ago
Are you implying the SWD signals would send the RAM contents every time? If I had to do that, I would first use a logic analyzer like Saleae to capture the SWD signals of a JLink performing the necessary operations to load the image into RAM. Then figure out, from the bytes that get send and received, whatever needs to be parameterized, and where to put the image data itself, perhaps by capturing different scenarios, and seeing what changes. Maybe even look up the SWD spec. You would also need to figure out what kind of back and forth is necessary, what must block waiting for a response. From there, assuming there isn't cryptography involved, it just becomes a matter of providing bytes to a bus in the correct order or timing based on the proper events. Some of those bytes are "canned" and never change. Some of them are parameters that describe some important quantity relevant your specific image. And the rest are your firmware image, probably chunked up with some overhead wrapped around it. I allow for the possibility that SWD is far more complex than I imagine, but this approach works pretty well for figuring out whats going on with SPI or I2C or BLE.
duskwuff · 11h ago
SWD and the associated debug interfaces are all documented by ARM; there's no need to reverse-engineer anything here. See the ADIv5 documentation [1] for a starter.

[1]: https://developer.arm.com/documentation/ihi0031/a

dmitrygr · 4h ago
ADIv6 for RP2350 (!important)
bsder · 11h ago
> I allow for the possibility that SWD is far more complex than I imagine, but this approach works pretty well for figuring out whats going on with SPI or I2C or BLE.

SWD is pretty well documented. I won't claim its simple, but, in my opinion, it's decent at what it does. The RISC-V folks haven't seemed to be able to do better (and, IMO, did quite a bit worse in a few places, actually).

The SWD description at the packet/command level: https://arm-software.github.io/CMSIS-DAP/latest/index.html

There is open source code directly from ARM for it: https://github.com/ARMmbed/DAPLink/tree/main/source/daplink/...

The documentation of the actual wire protocol is also extensive, but a little more scattered: https://developer.arm.com/documentation/ihi0031/a?lang=en https://community.nxp.com/pwmxy87654/attachments/pwmxy87654/...

The big problem with the SWD wire protocol ARM documentation (and everybody who copies it) is that they don't point out the fact that when you go from Write-to-Read the active edge of the clock changes. In SPI-speak, you switch from CPHA=1 to CPHA=0. This makes sense if you stop to think about it for a moment because during debug there is no clock. Consequently, SWD must provide the clock and you switch from "put something on DATA a half phase early->pulse clock to make chip do something with it" to "pulse clock which makes chip put something on Data->read it a half phase later". However, if it has never been pointed out to you before, it's likely to trip you up.

Sigrok (or similar) which can decode SWD properly and a digital signal analyzer (even a cheap $10 one) are your friends.

The only diagrams which seem to resemble scope traces that point this out are on obscure Chinese engineering blogs.

mrheosuper · 9h ago
this is also how some BLE controller boot.
mschuster91 · 13h ago
There's nothing speaking "version 1.0" more than a bunch of stuff just manually soldered as piggyback over other components of the board :D

Thanks for the writeup.