The Reggae Museum Foundation: Where Reggae and Dancehall History Lives (thereggaemuseumfoundation.org)

Hey all, As a follow up to my relatively successful series in x86 Assembly of last year[1], I started making an OS that fits in a boot sector. I am purposefully not doing chain loading or multi-stage to see how much I can squeeze out of 510bytes.

It comes with a file system, a shell, and a simple process management. Enough to write non-trivial guest applications, like a text editor and even some games. It's a lot of fun!

It comes with an SDK and you can play around with it in the browser to see what it looks like.

The aim is, as always, to make Assembly less scary and this time around also OS development.

[1]: https://news.ycombinator.com/item?id=41571971

Comments (32)

nathell · 72d ago

Some related stuff:

In 2004, Gavin Barraclough’s mini-OS [0] won the IOCCC, packing a 32-bit multitasking operating system for x86 computers, with GUI and filesystem, support for loading and executing user applications in ELF binary format, with PS/2 mouse and keyboard drivers, VESA graphics, a command shell, and an application into 3.5 KB of highly obfuscated C code.

In 2021, Justine Tunney wrote SectorLISP [1], a Lisp implementation that fits into a bootsector and is able to run McCarthy’s metacircular evaluator.

[0]: https://www.ioccc.org/2004/gavin/index.html [1]: https://github.com/jart/sectorlisp

90s_dev · 72d ago

Two questions:

1. I just saw how str_print is implemented. It's so short even though it's asm. Is this why nul-terminated strings were so popular and became the default in C? Would pascal strings be much longer/uglier/harder in asm?

2. Why is str_print repeated in multiple files? How would you do code sharing in asm? I assume str_print is currently not "static" and you'd have to make it so via linking or something, and then be able to get its address using an asm macro or something?

shikaan · 72d ago

1. If you look through the commit history, you'll see that the first implementation was actually with Pascal strings.

Printing with Pascal strings is actually shorter (you skip the null test, basically), but constructing Pascal strings to pass as an argument when they are not constants yielded much more code to prepare for that call. Had I had more leeway, I would have used Pascal strings, it much less headache.

2. Files in `/bin` all include from the SDK. You can pretty much do the same for utility functions.

The includes, at least in nasm, are very much like copy-pasted code (or includes in C for that matter), and then you can just jump/call to the label.

I did not do it because I haven't been able to get nasm to optimize away the code that I don't use, and I didn't want to bloat the binaries or make a file for a 5LOC function.

All in all not good reasons in general, but it made sense to me in this context.

90s_dev · 72d ago

Thanks for answering my questions. Your project is really really interesting.

Two more questions if you find some spare time:

3. Why does it use tty for interrupts instead of directly calling int 10?

4. How does this even print to the screen or use a tty in the first place? Is it just something inherent in bios api?

shikaan · 71d ago

Hey, thanks for your interest in this project!

3. The tty interrupt advances the cursor along with printing. So, once again, I do it to save on some instructions. In the first iterations I wanted to retain more control (by printing and moving as separate operations) so that I could reuse this across the board, but eventually I ran out of space.

4. I am relying heavily on BIOS interrupts, which are criminally underdocumented. The most reliable source is Ralph Brown's documentation[1] which is very far from what I was expecting to be authoritative documentation. Turns out this collection is really good and basically _the_ source of truth for BIOS interrupts.

To answer your question, yes, this is basically calling the BIOS API.

[1]: https://wiki.osdev.org/Ralf_Brown's_Interrupt_List

rerdavies · 71d ago

THIS is the bible for BIOS APIs"

https://bitsavers.trailing-edge.com/pdf/ibm/pc/ps2/PS2_and_P...

Complete with reference assembler source code.

shikaan · 71d ago

Oh boy, this is amazing! Thanks for the reference

kachapopopow · 72d ago

The linker probably compacts all of the code blocks and generally futher optimizes the final binary size.

shikaan · 72d ago

I would have assumed the same, but I haven't managed. On the other hand, I did not tinker too much with all these toggles; it's such a little amount of shared code (which is also partially different in some cases) that didn't particularly make sense to me.

If you know how to make it happen and/or want to contribute, hit me up (:

mycatisblack · 72d ago

Very cool! I have to ask: what would the total size be if the package included the bios functions?

Also: what could be done if the size limit were 8kbyte like the mask-rom bios days?

Thanks for pointing me towards the bosh emulator.

shikaan · 72d ago

Hey, thanks for taking a look!

On the former, I have no idea how to estimate BIOS functions size. Maybe I could just peek into an image and get a sense for it...

On the latter, with a 16x increase in available space, I guess I would do a much more thorough work in putting guardrails in place.

The API currently comes with a couple of traps (e.g., file names can be duplicated, processes are cooperative, all file operations perform disk I/O...) and it essentially requires guest applications to know about BIOS services in order to function.

Another sticky point I wish I had the space to address better are calling conventions, which I had to get rid of almost immediately to save on instructions.

> Thanks for pointing me towards the bosh emulator.

You're welcome! Bochs is such a nice tool which I discovered only for this project as well. It was a no-brainer, since I got no way to debug 16-bit assembly from QEMU (unless you go off and fork it[1])

[1]: https://gist.github.com/Theldus/4e1efc07ec13fb84fa10c2f3d054...

userbinator · 72d ago

what would the total size be if the package included the bios functions?

Probably a few dozen to over a hundred KB, maybe even over a MB, depending on the era of machine and what it has installed; e.g. the GPU option ROM would be included if you use int 10h, int 13h might be hooked by a disk adapter, and if you use int 16h to read from a USB keyboard, that'll go through the BIOS' USB stack which normally includes some code in SMM too.

djaychela · 71d ago

I have a more general question - what is the minimum that is needed to qualify as an operating system? Is there something agreed on generally? Searching operating system minimum requirements leads to the wrong kind of info for me....

shikaan · 71d ago

Honestly, I made it up :)

I thought about what would be the minimum I have to build in order to run some userland software that does "something". That to me looked like: spawn guest applications, make them persist something.

With slightly more leeway, I would probably do memory management as the next thing (besides what I mentioned in another thread here)

musicale · 71d ago

BIOS is underrated. Basically the driver portion of a DOS- (or CP/M)-like operating system. As demonstrated, you don't need to add too much to it (program loader, simple file system, maybe a command shell and system utilities if you are ambitious) to get a functional mini-DOS.

yjftsjthsd-h · 72d ago

Well that's cool. Does the name stand for something?

shikaan · 72d ago

The -le suffix is used in south of Germany for the small version of something. So OSle stands for small OS.

I'm not a native speaker, so maybe somebody else can paint a better picture. I used it just because part of my extended family comes from there (:

EDIT: s/prefix/suffix/

lloeki · 72d ago

I live in Alsace, which is in France but has a German-like dialect (Alemannic)

https://en.m.wikipedia.org/wiki/Alsatian_dialect

-ele is used a lot to denote something small, cute, adorable; maybe think of it as kind of like ちび (chibi) or -ちゃん (-chan) in Japanese.

Mann (man) => Mannele https://cookingwithbrendagantt.net/mannele-st-nicholas-bread...

Katz (cat) => katzele (kitty)

The suffix can be liberally (ab)used with any - native or foreign - word or (sur)name to emphatic or comedic effect.

Here I kinda guessed the -le use was such but around here I would have said "OSele" (oh-ess-uh-luh)

ninalanyon · 70d ago

Similar in English, the ie suffix is used to create a diminutive. Sweet -> sweetie. You can make cute cuter by saying cutie.

evertedsphere · 72d ago

as seen also in Spätzle, Müsli, or, to pick something more relevant on HN, the words Brötli (or Zöpfli)

-li is a different version of the same ending

unwind · 72d ago

*suffix.

A prefix goes before something.

shikaan · 72d ago

Indeed. Thanks for the correction; I edited the original message

fuzzfactor · 72d ago

On projects like this, where the IMG is small enough, I would think it was ideal to include osle.img with the zip.

sim7c00 · 72d ago

cool stuff, like you still fit quite a bit in there too, 510 bytes can be tricky.

if you want an ahci controller to 'see' it, it will need partition table too, which will make it even less bytes (or maybe cleverly encoded)

shikaan · 72d ago

I went back and forth about the file system and disk stuff a fair bunch, to be honest. Most of it, as you say, was mostly due to wrestling the space constraints.

If one day I'll give in and take the shell out or go multi-stage, I will definitely look at that.

Maybe it's worth blogging about the journey; it's been a few weeks of merciless trade-offs to reach a usable API. It can make for a fun read (:

Thanks for taking a look!

sim7c00 · 71d ago

haha, well all the best! its a cool project. i am happy i can forgot about BIOS and went UEFI haha. remember so many tedious nights trying to get an mbr to load an elf file and init x64 mode in one go :'). uefi (edk2) is a blessing if you come from BIOS land (tho mybe less fun in a way!)

userbinator · 72d ago

What sectors contain is irrelevant to AHCI. As long as the BIOS contains the appropriate interface to a block device, it can be used.

sim7c00 · 71d ago

the BIOS will recognize block devices as being of certain type and present them to controllers.

if you do not put partition table, qemu AHCI controller will not recognize disk as bootable and u cant use SATA. with only the magic footer at the end of mbr, it will only work on IDE controller.

try it.

userbinator · 71d ago

the BIOS will recognize block devices as being of certain type and present them to controllers.

What exactly do you mean by that? Device discovery proceeds from the root (usually PCIe bus, after CPU-specific init) to the leaves, not the other way around.

qemu AHCI controller

That's its problem then. This isn't a problem on real hardware.

revskill · 72d ago

All professors should be doing this decades ago right ?

stonogo · 72d ago

Why? That is to say: it's a really cool project, and clearly a labor of love, but from an academic perspective it's a collection of x86-specific commands.

revskill · 72d ago

I must be honest. Professors are not doing their good job here.

Markdown for the AI Era (github.com)

Bitcoin Lottery ~$1T prize pool (happystoic.github.io)

Gofumpt: A Stricter Gofmt (github.com)

Kimi-cc: Use Kimi latest model(kimi-k2-0711-preview) to drive your Claude Code (github.com)

Shoreman: Foreman in Shell (github.com)

A modular, LLM-agnostic fullstack agent framework (github.com)

How Trump's crackdown on universities is affecting the world (nytimes.com)

Latent Reflection (2025) (rootkid.me)

BB(6) Is Hard (Antihydra) (sligocki.com)

Augustus Jansson's Queen City Ink Adverts (publicdomainreview.org)

Windsurf Gambit: OpenAI Acquisition Turns into Google 'Hackqusition' (spyglass.org)

Show HN: CNDY – AI-First IRL Dating (cndy.world)

Unidentified object might have just crashed into Saturn (bgr.com)

Infinite Torment Nexus (tormentnexus.live)

Haiku Activity and Contract Report, June 2025 (haiku-os.org)

Google Gemini flaw hijacks email summaries for phishing (bleepingcomputer.com)

Open-source STM32 autopilot for long-range fixed-wing UAVs (SmartNavX)

Datadog Acquisitions (datadoghq.com)

Does Showing Seconds in the System Tray Use More Power? (lttlabs.com)

The Reggae Museum Foundation: Where Reggae and Dancehall History Lives (thereggaemuseumfoundation.org)

Quality of scientific papers questioned as academics 'overwhelmed' (theguardian.com)

Ask HN: Is the future of Coding Guidelines oriented towards LLMs?

Spongy material and the sun's power remove salt from seawater (techxplore.com)

Show HN: ZenStreak - I built an iOS app to help me quit... (apps.apple.com)

I Accidentaly Wrote a Compile-Time Executable State Machine (2020) (philippegroarke.com)

Jeffreys's Amazing Statistics Program (jasp-stats.org)

'I felt pure, unconditional love': the people who marry their AI chatbots (theguardian.com)

An official Swedish AI service was misused to create a greeting for Adolf Hitler (iconofsweden.com)

The missing guide to Dataflow Analysis in MLIR (lowlevelbits.com)

Folly: IOBuf and Zero-Copy Networking (uvdn7.github.io)

Am I not a man and a brother? (en.wikipedia.org)

A new Martian climate model suggests a mostly cold, harsh ancient environment (arstechnica.com)

Photos: The Scale of China's Solar-Power Projects (theatlantic.com)

Show HN: Quikfolioz – Vibe-Coded Portfolios, Crafted Fast for Devs on the Go (github.com)

Algorithms for making interesting organic simulations (bleuje.com)

Show HN: A Raycast-compatible launcher for Linux (github.com)

Show HN: Ethically – Daily moral dilemmas answered by you and AI (pickethically.com)

Sunday: Track Your D (github.com)

Does ChatGPT Encourage Dangerous Delusions? (thenewstack.io)

How much attention do you need, really? Experiments in O(1) latent reasoning (notion.so)

The average Windows user doesn't care about TPM 2.0 (neowin.net)

SimSun 12pt: A Default with a Legacy (fayziev.com)

A Technical Look at Iran's Internet Shutdowns (zola.ink)

Prompt Engineering: From Zero to Hero (promptz2h.com)

A Rust implementation of the Haraka hash function (github.com)

An Ultra Opinionated Guide to Reinforcement Learning (twitter.com)

Passive-Agressive Office Simulator Slack Bot (github.com)

Centuries-old DNA reveals origins of Greenland's unique dogs (nature.com)

Show HN: CMS-like editing for Markdown with contenteditable and 100 lines of JS (mattismegevand.com)

India releases roadmap for quantum-safe cybersecurity (ddnews.gov.in)

Show HN: OSle – A 510 bytes OS in x86 assembly

Comments (32)