Part 1: A Deep Dive into Rust and C Memory Interoperability

76 hyperbrainer 36 8/4/2025, 3:12:20 PM notashes.me ↗

Comments (36)

veber-alex · 1h ago

The reason you are not seeing crashes when allocating with Rust and freeing with C (or vice versa) is that by default Rust also uses the libc allocator.

https://stdrs.dev/nightly/x86_64-unknown-linux-gnu/src/std/s...

ryanf · 1h ago

This article looked interesting, but I bounced off it because the author appears to have made heavy use of an LLM to generate the text. How can I trust that the content is worth reading if a person didn't care enough to write it themselves?

zem · 55m ago

it sounds nothing like AI to me! or AI has advanced to the point where it is hard to tell - e.g. I wouldn't expect a sentence like "You’re not just getting 64 bytes of memory. You’re entering into a complex contract with a specific allocator implementation." from one.

pests · 46m ago

While I usually hate all the accusations of writings being LLM generated, I find your example a bit odd as that phasing is very typical of ChatGPT, especially when it was glazing everyone after that one update they had to reverent.

“It’s not just _________. It’s _________________.”

This was in almost every response doubling down on the users ideas and blowing things out of proportion. Stuff like…

“It’s not just a good idea. It’s a ground up rewriting of modern day physics.”

rocky_raccoon · 38m ago

I picked up on it very quickly as well. Here are some more phrases that match that same LLM pattern. Sure, you could argue that someone actually writes like this, but after a while, it becomes excessive.

- Your program continues running with a corrupted heap - a time bomb that will explode unpredictably later.

- You’re not just getting 64 bytes of memory. You’re entering into a complex contract with a specific allocator implementation.

- The Metadata Mismatch

- If it finds glibc’s metadata instead, the best case is an immediate crash. The worst case? Silent corruption that manifests as mysterious bugs hours later.

- Virtual Memory: The Grand Illusion

- CPU Cache Architecture: The Hidden Performance Layer

- Spoiler: it’s even messier than you might think.

zem · 36m ago

huh, interesting, I guess I haven't read enough of it to pick up on the patterns

rectang · 1h ago

I find it hard to believe that an LLM would have come up with this quote to start the article:

> “Memory oppresses me.” - Severian, The Book of the New Sun

That sort of artistic/humourous flourish isn't in character for an LLM.

TechDebtDevin · 1h ago

Do you see Emojis in tables/code now and assume the person is using an llm? I dont really see it.

shiftingleft · 57m ago

The author admits to it.

https://www.reddit.com/r/rust/comments/1mh7q73/comment/n6uan...

The reply to that comment is also a good explainer of why the post has such a strong LLM smell for many.

ryanf · 47m ago

Yeah, I completely agree with that reply, thanks for the link.

BTW that Reddit post also has replies confirming my suspicions that the technical content wasn't trustworthy, if anyone felt like I was just being snobby about the LLM writing: https://www.reddit.com/r/rust/comments/1mh7q73/comment/n6ubr...

ryanf · 1h ago

Maybe I'm too paranoid! If it's not LLM then I don't think it's a very well-organized post though.

In addition to the emoji, things that jumped out at me were the pervasive use of bullet lists with bold labels and some specific text choices like

> Note: The bash scripts in tools/ dynamically generate Rust code for specialized analysis. This keeps the main codebase clean while allowing complex experiments.

But I did just edit my post to walk it back slightly.

skydhash · 54m ago

Not TFA’s author

As a non-native English speaker, 90% of my vocabulary come from technical books and SF and Fantasy novels. And due to an education done in French, I tend to prefer slightly complicated sentences forms.

If someone uses LLM to give their posts clarity or for spellchecking, I would aplaud them. What I don’t agree with, LLM use or no, is meandering and inconsistency.

OmarAssadi · 1h ago

Personally, it is one of the flags, yeah. It's been a while since I've tried ChatGPT or some of the others, but the structure and particular usage felt a lot like what I'd have gotten out of deepseek.

It's not a binary thing, of course, but it's definitely an LLM smell, IMO.

mvieira38 · 1h ago

I mean, are we supposed not to? This doesn't read like a blog at all, it even has the dreaded "Key Takeaways" end section... The content is good and seems genuinely researched, but the text looks "AI enhanced", that's all

phkahler · 2h ago

Something I'd like to know for mixing Rust and C. I know it's possible to access a struct from both C and Rust code and have seen examples. But those all use accessor functions on the Rust side rather than accessing the members directly. Is it possible to define a structure in one of the languages and then via some wrapper or definitions be able to access it idiomatically in the other language? Can you point to some blog or documentation explaining how?

oconnor663 · 1h ago

Here's one of my recorded talks going through an example of using a `#[repr(C)]` struct (in this case one that's auto-generated by Bindgen): https://youtu.be/LLAUzghhNHg?t=2168

GrantMoyer · 1h ago

Rust bindgen[1] will automatically generate native Rust stucts (and unions) from C headers where possible. Note that c_int, c_char, etc. are just aliases for the corresponding native Rust types.

However, not all C constructs have idomatic Rust equivalents. For example, bitfields don't exist in Rust, and unlike Rust enums, C enums can have any value of the underlying type. And for ABI reasons, it's very commom in C APIs to use a pointer to an opaque type paired with what are effectively accessor function and methods, so mapping them to accessors and methods on a "Handle" type in Rust often is the most idomatic Rust representation of the C interface.

[1]: https://github.com/rust-lang/rust-bindgen

pmalynin · 2h ago

Like, repr(C)?

https://doc.rust-lang.org/nomicon/other-reprs.html

Arnavion · 2h ago

I don't know what examples you've been seeing. The interop structs are just regular Rust structs with the `#[repr(C)]` attribute applied to them, to ensure that the Rust compiler lays the struct out exactly as the C compiler for that target ABI would. Rust code can access their fields just fine. There's no strict need for accessor functions.

stouset · 1h ago

And vice versa. Rust code and C code can both operate on each other’s structs natively.

`#[repr(C)]` instructs the compiler to lay the struct out exactly according to C’s rules: order, alignment, padding, size, etc. Without this, the compiler is allowed a lot more freedom when laying out a struct.

eatonphil · 2h ago

One of the areas I wonder about this a lot is when integrating Rust code into Postgres which has its own allocator system. Mostly right now when we need to have complex data structures (non-Postgres data structures) that must live outside of the lexical scope we put them somewhere global and return a handle to the C code to reference the object. But with the upcoming support for passing an allocator to any data structure (in the Rust standard library anyway) I think this gets a lot easier?

Arnavion · 2h ago

>But with the upcoming support for passing an allocator to any data structure (in the Rust standard library anyway) I think this gets a lot easier?

Yes and no. Even within libstd, some things require A=GlobalAlloc, eg `std::io::Read::read_to_end(&mut Vec<u8>)` will only accept Vec<u8, GlobalAlloc>. It cannot be changed to work with Vec<u8, A> because that change would make it not dyn-compatible (nee "object-safe").

And as you said it will cut you off from much of the third-party crates ecosystem that also assumes A=GlobalAlloc.

But if the subset of libstd you need supports A=!GlobalAlloc then yes it's helpful.

tialaramex · 1h ago

For me the most interesting thing in Allocator is that it's allowed to say OK, you wanted 185 bytes but I only have a 256 byte allocation here, so, here is 256 bytes.

This means that e.g. a growable container type doesn't have to guess that your allocator probably loves powers of 2 and so it should try growing to 256 bytes not 185 bytes, it can ask for 185 bytes, get 256 and then pass that on to the user. Significant performance is left on the table when everybody is guessing and can't pass on what they know due to ABI limitations.

Rust containers such as Vec are already prepared to do this - for example Vec::reserve_exact does not promise you're getting exactly the capacity you asked for, it won't do the exponential growth trick because (unlike Vec::reserve) you've promised you don't want that, but it would be able to take advantage of a larger capacity provided by the allocator.

steveklabnik · 2h ago

I’m not sure what those two things have to do with each other, though I did just wake up. The only thing the new allocator stuff would give you is the ability to allocate a standard library data structure with the Postgres allocator. Scoping and handles and such wouldn’t change, and using your own data structures wouldn’t change.

It’s also very possible I’m missing something!

eatonphil · 2h ago

> The only thing the new allocator stuff would give you is the ability to allocate a standard library data structure with the Postgres allocator.

Yeah no this is basically all I'm saying. I'm excited for this.

steveklabnik · 1h ago

Ah yeah, well it's gonna be a good feature for sure when it ships!

Tony_Delco · 1h ago

Fantastic opening line (“Memory oppresses me.”). If this article was written by an AI, it’s the best AI I’ve seen in months.

Seriously though: I already knew the “don’t mix allocators” rule, but I really enjoyed seeing such a careful and hands-on exploration of why it’s dangerous. Thanks for sharing it.

tracker1 · 1h ago

Interesting read... and definitely good to know base of knowledge especially if you're working in transitional or mixed codebases.

sesm · 2h ago

Section named "The Interview Question That Started Everything" doesn't contain the interview question.

hyperbrainer · 2h ago

That's the first thing on the page.

> Interviewer: “What happens if you allocate memory with C’s malloc and try to free it with Rust’s dealloc, if you get a pointer to the memory from C?”

> Me: “If we do it via FFI then there’s a possibility the program may continue working (because the underlying structs share the same memory layout? right? …right?)”

sesm · 1h ago

That's fair. Personally, I've skipped that entire pre-section thinking it's a long quote from some book.

7e · 2h ago

Allocating memory with C and freeing it with Rust is silly. If you want to free a C-allocated pointer in Rust, just have Rust call back in to C. Expecting that allocators work identically in both runtimes is unreasonable and borderline insane. Heck, I wouldn't expect allocators to work the same even across releases of libc from the same vendor (or across releases of Rust's std).

rectang · 1h ago

I don't agree with your contemptuous framing. It's incorrect, and per the post's author, "dangerous" — but depending on your background it's not "silly" or "borderline insane". It's just naive, and writing a slab allocator as an exercise or making honest explorations like in this blog post will help cure the naivete.

benmmurphy · 1h ago

usually when interfacing with a library written in c the library will export functions for object destruction. it makes sense for that to be part of the interface instead of using the system allocator because it also gives the library freedom to do extra work during object destruction. if you have simple objects then its possible to just use the system allocator, but if you have graphs or trees of objects then its necessary to have a custom destroy function and there is always some risk in the future you might be forced to need to allocate more complex data structures that require multiple allocations.

Arnavion · 1h ago

The article is about how and why mixing allocators fails, not if it fails or how to fix the problem.

jokoon · 1h ago

Any insight on the quantity of paid rust job out there?

Wildfire Evacuation from Berkeley Hills Could Take over 4 Hours, Study Finds (theconversation.com)

Tiny fossil suggests spiders and their relatives originated in the sea (news.arizona.edu)

How childhood gaming formed my work preferences (hamatti.org)

Show HN: A simple word search game (wordgame.o565.com)

Show HN: Miniwhips – Transform your car photo into its toy version (miniwhips.app)

Ask HN: What "trick of the trade" took you too long to learn?

Big O vs. Hardware: Better Complexity ≠ Better Performance (blog.codingconfessions.com)

Show HN: ReplyFast – AI replies to your emails instantly, in your own style

Show HN: Automate social media with postiz and n8n (github.com)

Passive Smartphone Sensors for Detecting Psychopathology (jamanetwork.com)

The impact of climate change on state and local governments' fiscal health (brookings.edu)

SQLite offline sync for Android quick start (github.com)

Zinus Saves $140k and Cuts Development Time by 50% with Replit (blog.replit.com)

Show HN: SuppScanr – Search supplements, build stacks, check interactions (suppscanr.com)

Apple's history is hiding in a Mac font (spacebar.news)

The Internet's Own Boy: The Story of Aaron Swartz (archive.org)

D-Wave Introduces New Developer Tools to Advance Quantum AI Developement (dwavequantum.com)

Building Native Apps with PHP: The Story of Simon Hamp and Native PHP (open.spotify.com)

IT firing spree: Shrinking job market looks worse after BLS revisions (theregister.com)

Show HN: I Made a plugin-based workspace app to avoids bloat (hollow-space.vercel.app)

Volonaut: Personal Hoverbike (volonaut.com)

A Gentle Introduction to Fortran (hackaday.com)

Show HN: A rental marketplace for migrant founders moving to SF for YC F25 (app.splitin.net)

I Found 12 People Who Ditched Their Expensive Software for AI-Built Tools (every.to)

Founder's responsibility to protect the value of your team's equity

SEP-XXXX: Server-Side Authorization Management with Client Session Binding (github.com)

Exploring NotebookLM Alternatives (kdnuggets.com)

Using Git Worktrees for Development (blog.kulman.sk)

Diagrammatic algebra: On the road to category theory (chalkdustmagazine.com)

True Unidirectional WiFi broadcasting of video data for FPV Drones (befinitiv.wordpress.com)

Red-teaming a RAG app: What happens? (blog.pamelafox.org)

A modest proposal for new holidays to manage your digital life (daverupert.com)

Show HN: Host local-only MCP tools in the cloud with Streamable HTTP

Ask HN: How to build a 2D wave-like line graph that responds to keyboard events?

Tandy Corporation, Part 4 – By Bradford Morgan White (abortretry.fail)

I Asked Four Former Friends Why We Stopped Speaking-Here's What I Learned (2023) (vogue.com)

Qwen-Image – a 20B MMDiT model for next-gen text-to-image generation (twitter.com)

Show HN: Modos Developer Kit Live on Crowd Supply (crowdsupply.com)

An Open Letter to OpenAI (openai-transparency.org)

Castro Podcasts – iPad and Device Sync (castro.fm)

Evaluation Algorithms for Parametric Curves and Surfaces (mdpi.com)

Squashing my dumb bugs and why I log build IDs (rachelbythebay.com)

LLMs Aren't Just for Sissies (mattsayar.com)

Staan : European Search Index and API (staan.ai)

Robin Berjon: Web Standards (protocol.ecologies.info)

JavaOne 2026 Dates Announced (inside.java)

A proof is that which is convincing (substack.com)

Updated Portal Map Editor in Battlefield 6 Runs on Godot Engine (80.lv)

AI Embiggens the Big Clouds, Especially Microsoft (nextplatform.com)

Firefox Has a New Home (windowsreport.com)

Part 1: A Deep Dive into Rust and C Memory Interoperability

Comments (36)