Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

Yes but also no. The thing about software is that 90% of it is not culturally bound. If you're writing, say, some tax reporting tool, a grammar reference, or something religious… sure, it makes sense to write that in your language. So, yeah, C should support that.

However, everything else, from spreadsheet software to CAD tools to OS kernels to JavaScript frameworks is universal across cultures and languages. And for better or for worse (I'm not a native English speaker either), the world has gone with English for a lot of code commons.

And the thing with the examples in that post isn't about supporting language diversity, it's math symbols which are noone's native language. And you pretty much can't type them on any keyboard. Which really makes it a rather poor flex IMHO. Did the author reconfigure their keyboard layout for that specific math use case? It can't generically cover "all of math" either. Or did they copy&paste it around? That's just silly.

ajross · 9m ago

Little to no source code is written for single (human) language development teams. Sure, everyone would like the ability to write source code in their native language. That's natural.

Literally no one, anywhere, wants to be forced to read source written in a language they can't read (or more specifically in this case: written in glyphs they can't even produce on their keyboard). That idea, for almost everyone, seems "horrific", yeah.

So a lingua franca is a firm requirement for modern software development outside of extremely specific environments (FSB malware authors probably don't care about anyone else reading their cyrillic variable names, etc...). Must it be ASCII-encoded English? No. But that's what the market has picked and most people seem happy enough with it.

tialaramex · 4h ago

Presumably this was converted from markdown or similar and the conversion partly failed or the input was broken.

From the PVI section onward it seems to recover, but if the author sees this please fix and re-convert your post.

[Edited, nope, there are more errors further in the text, this needed proper proofreading before it was posted, I can somewhat struggle through because I already know this topic but if this was intended to introduce newcomers it's probably very confusing]

zombot · 4h ago

Does C allow Unicode identifiers now, or is that pseudo code? The code snippets also contain `&`, so something definitely went wrong with the transcoding to HTML.

pjmlp · 3h ago

Besides the sibling comment on C23, it does work fine on GCC.

https://godbolt.org/z/qKejzc1Kb

Whereas clang loudly complains,

https://godbolt.org/z/qWrccWzYW

Y_Y · 39m ago

Implementation-defined until C99, explicitly possible via UCNs aince c99, possible with explicit encoding since C23, but literals are still implementation defined.

qsort · 4h ago

Quoting cppreference:

An identifier is an arbitrarily long sequence of digits, underscores, lowercase and uppercase Latin letters, and Unicode characters specified using \u and \U escape notation(since C99), of class XID_Continue(since C23). A valid identifier must begin with a non-digit character (Latin letter, underscore, or Unicode non-digit character(since C99)(until C23), or Unicode character of class XID_Start)(since C23)). Identifiers are case-sensitive (lowercase and uppercase letters are distinct). Every identifier must conform to Normalization Form C.(since C23)

In practice depends on the compiler.

dgrunwald · 4h ago

But the source character set remains implementation-defined, so compilers do not have to directly support unicode names, only the escape notation.

Definitely a questionable choice to throw off readers with unicode weirdness in the very first code example.

qsort · 4h ago

If it were up to me, anything outside the basic character set in a source file would be a syntax error, I'm simply reporting what the spec says.

ncruces · 3h ago

I use unicode for math in comments, and think makes certain complicated formulas far more readable.

kzrdude · 1h ago

I've just been learning pinyin notation, so now i think the variable řₚ should have a value that first goes down a bit and then up.

zelphirkalt · 1h ago

I am not sure it is a good idea to mix such specific phonetic script ideas about diacritic marks with the behavior of the program over time. Even considering the shape, it does not align with the idea of first down a little, then up a lot.

guipsp · 2h ago

What a "basic character set" is depends on locale

qsort · 2h ago

https://en.cppreference.com/w/c/language/charset.html

account42 · 1h ago

Anything except US-ASCII in source code outside comments and string constants should be a syntax error.

guipsp · 53m ago

You are aware other languages exist? Some of which don't even use the Latin script?

Y_Y · 38m ago

What; like APL‽

unwind · 4h ago

I can't even view the post, I just get some kind of content management system-like with the page as JSON or something, in pink-on-white. I'm super confused. :|

The answer to your question seems to (still) be "no".

eqvinox · 42m ago

Using the "register" storage class feels really alien for C code written in 2025…

jvanderbot · 3h ago

I love Rust, but I miss C. If C can be updated to make it generally socially acceptable for new projects, I'd happily go back for some decent subset of things I do. However, there's a lot of anxiety and even angst around using C in production code.

flohofwoe · 3h ago

> to make it generally socially acceptable for new projects...

Or better yet, don't let 'social pressure' influence your choice of programming language ;)

If your workplace has a clear rule to not use memory-unsafe languages for production code that's a different matter of course. But nothing can stop you from writing C code as a hobby - C99 and later is a very enjoyable and fun language.

Y_Y · 36m ago

I don't want to summon WB, but honest-to-god, D is a good middle ground here.

TimorousBestie · 1h ago

> Or better yet, don't let 'social pressure' influence your choice of programming language ;)

It’s hard. Programming is a social discipline, and the more people who work in a language, the more love it gets.

spauldo · 16m ago

If you're on UNIX or working in the embedded space, C is still everywhere and gets lots of love. C tends to get lots of libraries anyway because everything can FFI to it.

xxs · 2h ago

I was about the reply no amount of pressure can tell me how to program. C was totally fine for esp32

bnferguson · 2h ago

Feels like Zig is starting to fill that role in some ways. Fewer sharp edges and a bit more safety than C, more modern approach, and even interops really well with C (even being possible to mix the two). Know a couple Rust devs that have said it seems to scratch that C itch while being more modern.

Of course it's still really nice to just have C itself being updated into something that's nicer to work with and easier to write safely, but Zig seems to be a decent other option.

dnautics · 1h ago

(self-promotion) in principle one should be able to implement a fairly mature pointer provenance checker for zig, without changing the language. A basic proof of concept (don't use this, branches and loops have not been implemented yet):

https://www.youtube.com/watch?v=ZY_Z-aGbYm8

pjmlp · 2h ago

As usual the remark that much of the Zig's safety over C, has been present since the late 1970's in languages like Modula-2, Object Pascal and Ada, but sadly they didn't born with curly brackets, nor brought a free OS to the uni party.

mikewarot · 3h ago

If you can stomach the occasional Begin and End, and a far less confusing pointer syntax, Pascal might be the language for you. Free Pascal has some great string handling, so you never have to worry about allocating and freeing them, and they can store gigabytes of text, even Unicode. ;-)

jvanderbot · 3h ago

If my fellow devs cringe at C, imagine their reaction to Pascal

mikewarot · 2h ago

C has all the things to hate in a programming language

  CaSe Sensitivity
  Weird pointer syntax
  Lack of a separate assignment token
  Null terminated strings
  Macros - the evil scourge of the universe

On the plus side, it's installed everywhere, and it's not indent sensitive

ioasuncvinvaer · 1h ago

Except for null terminated strings these don't seem like mayor issues to me. Can you elaborate?

zelphirkalt · 1h ago

You mean "mere string replacement macros, instead of hygienic macros", of course : )

jvanderbot · 2h ago

At this point, you're talking to someone who isn't here

1718627440 · 1h ago

> Lack of a separate assignment token

What does that mean?

kbolino · 1h ago

Assignment is = which is too close to equality == and thus has been the source of bugs in the past, especially since C treats assignment as an expression and coerces lots of non-boolean values to true/false wherever a condition is expected (if, while, for). Most compilers warn about this at least nowadays.

tgv · 3h ago

Or try Ada.

modeless · 2h ago

Fil-C is a modified version of Clang that makes C and C++ memory safe. It supports things you wouldn't expect to work like signal handling or setjmp/longjmp. It can compile real C projects like SQLite and OpenSSL with minimal to no changes, today. https://github.com/pizlonator/llvm-project-deluge/blob/delug...

b0a04gl · 1h ago

provenance model basically turns memory back into a typed value. finally malloc wont just be a dumb number generator, it'll act more like a capability issuer. and access is not 'is this address in range' anymore, but “does this pointer have valid provenance”. way more deterministic, decouples gcc -wall

HexDecOctBin · 1h ago

Will this create more nasal demons? I always disable strict aliasing, and it's not clear to me after reading the whole article whether provenance is about making sane code illegal, or making previously illegal sane code legal.

jcranmer · 36m ago

All C compilers have some notion of pointer provenance embedded in them, and this is true going back decades.

The problem is that the documented definitions of pointer provenance (which generally amount to "you must somehow have a data dependency from the original object definition (e.g., malloc)") aren't really upheld by the optimizer, and the effective definition of the optimizer is generally internally inconsistent because people don't think about side effects of pointer-to-integer conversion. The one-past-the-end pointer being equal (but of different provenance) to a different object is a particular vexatious case.

The definition given in TS6010 is generally the closest you'll get to a formal description of the behavior that optimizers are already generally following, except for cases that are clearly agreed to be bugs. The biggest problem is that it makes pointer-to-int an operation with side effects that need to be preserved, and compilers today generally fail to preserve those side effects (especially when pointer-to-int conversion happens more as an implicit operation).

The practical effect of provenance--that you can't magic a pointer to an object out of thin air--has always been true. This is largely trying to clarify what it means to actually magic a pointer out of thin air; it's not a perfect answer, but it's the best answer anyone's come up with to date.

layer8 · 23m ago

This is basically a formalization of the general understanding one already had when reading the C standard thoroughly 25 years ago. At least I was nodding along throughout the article. It cleans up the parts where the standard was too imprecise and handwavy.

cenobyte · 45m ago

Please fix the code in your post.

briandw · 3h ago

The code blocks are very difficult to read on this page. I had ChatGPT O3 rewrite this in a more accessible format. https://chatgpt.com/share/68629096-0624-8005-846f-7c0d655061...

cenobyte · 49m ago

So much better. Thank you!

The Academic Pipeline Stall: Why Industry Must Stand for Academia – ACM Sigops (sigops.org)

Show HN: TokenDagger – A tokenizer faster than OpenAI's Tiktoken (github.com)

Donkey Kong Country 2 and Open Bus (jsgroth.dev)

The provenance memory model for C (gustedt.wordpress.com)

Show HN: New Ensō – first public beta (untested.sonnet.io)

Gridfinity: The modular, open-source grid storage system (gridfinity.xyz)

The Plot of the Phantom, a text adventure that took 40 years to finish (scottandrew.com)

Printegrated Circuits: Merging 3D Printing and Electronics (spectrum.ieee.org)

Show HN: Open-Source International Space Station Tracker ESP32/Arduino for $20 (github.com)

Reverse Engineering Vercel's BotID (nullpt.rs)

New proof dramatically compresses space needed for computation (scientificamerican.com)

Ask HN: What Are You Working On? (June 2025)

I made my VM think it has a CPU fan (wbenny.github.io)

Ubuntu: Introducing Debcrafters (discourse.ubuntu.com)

Cross-Compiling Common Lisp for Windows (fosskers.ca)

Data Centers, Temperature, and Power (backblaze.com)

A glob of 99M-year-old amber trapped a zombie fungus erupting from a fly (cnn.com)

Shadow of a Doubt (harpers.org)

Revisiting Knuth's “Premature Optimization” Paper (probablydance.com)

Cell Towers Can Double as Cheap Radar Systems for Ports and Harbors (2014) (spectrum.ieee.org)

Ultrasound toothbrush promises painless checks for hidden gum problems (phys.org)

Show HN: I built a daily sunlight tracker (lumehealth.io)

Jane Austen's Boldest Novel Is Also Her Least Understood (nytimes.com)

The role of the University is to resist AI (danmcquillan.org)

Anticheat Update Tracking (not-matthias.github.io)

Event – Fast, In-Process Event Dispatcher (github.com)

Want to meet people, try charging them for it? (notes.eatonphil.com)

NativeJIT: A C++ expression –> x64 JIT (2018) (github.com)

Does Form Shape Function? (quantamagazine.org)

The Medley Interlisp Project: Reviving a Historical Software System [pdf] (interlisp.org)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

The $25k car is going extinct? (media.hubspot.com)

Building untrusted container images safely at scale (depot.dev)

How urea forms spontaneously (ethz.ch)

The Book of Shaders (2015) (thebookofshaders.com)

Nearly 20% of cancer drugs defective in four African nations (dw.com)

Use keyword-only arguments in Python dataclasses (chipx86.blog)

LetsEncrypt – Expiration Notification Service Has Ended (letsencrypt.org)

Finding a former Australian prime minister’s passport number on Instagram (2020) (mango.pdf.zone)

Modelling API rate limits as diophantine inequalities (vivekn.dev)

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok (github.com)

Touching the back wall of the Apple store (blog.lauramichet.com)

We accidentally solved robotics by watching 1M hours of YouTube (ksagar.bearblog.dev)

Reverse Engineering the Microchip CLB (mcp-clb.markomo.me)

Sequence and first differences together list all positive numbers exactly once (oeis.org)

Bloom Filters by Example (llimllib.github.io)

Arm muscles into server market – but can't wrestle control from x86 just yet (theregister.com)

To the Postbox (literaryreview.co.uk)

Implementing fast TCP fingerprinting with eBPF (halb.it)

Using the Internet without IPv4 connectivity (jamesmcm.github.io)

The provenance memory model for C

Comments (54)