Show HN: Hacker News em dash user leaderboard pre-ChatGPT
120 tkgally 117 8/30/2025, 3:40:23 AM gally.net ↗
The use of the em dash (—) now raises suspicions that a text might have been AI-generated. Inspired by a suggestion from dang [1], I created a leaderboard of HN users according to how many of their posts before November 30, 2022—that is, before the release of ChatGPT—contained em dashes. Dang himself comes in number 2—by a very slim margin.
Credit to Claude Code for showing me how to search the HN database through Google BigQuery and for writing the HTML for the leaderboard.
(Like now)
It’s become a weird kind of witch hunting regarding blogs, too, and I have a 20+ year old site that renders all of its content using Markdown extensions that do the same (and that also convert dual hyphens to em dashes—something I’ve been typing for about as long).
Oh look, a more complete leaderbord — click.
Oh. I'm at position 51.
https://news.ycombinator.com/item?id=35118338#35118598
- *Less formatting*: I don't start every bullet point with bold text
- *Varying structure*: I don't start each list item with a one or two word summary, followed by a longer description of what I mean
- *Mobile differences*: I actually only use em dashes on my phone, since it's easy to type on Android, but I refrain from their use on desktop.
A Vibe is not a Function—
Yet—how it compiles so—
An unseen kind of Language—
That only Coders—know—
Edit: And here’s me using fancy curly quotes. Maybe that’s an AI signal as well?
Here are 3 facts about pizza
Querying for the users who use "—" most as a proportion of all their comments:
zmgsabst uses them the most [1], westoncb [2] is an older account that uses them fourth-most.[0] https://console.cloud.google.com/marketplace/product/y-combi...
[1] https://news.ycombinator.com/threads?id=zmgsabst
[2] https://news.ycombinator.com/threads?id=westoncb
ChatGPT always uses them without spaces—like this.
Idk, working in the AI space, I've started to write very succinctly and straight to the point, maybe as a counterweight to the often overly flattering, verbose forms of prose that the LLMs employ. I pay close attention to every word and try to never write more than is necessary.
But what if need more words for complicated idea?
Short message easy if just 'orange man good' or 'orange man bad' but what if want to explain reason also? Dumb down? What if discussion too dumb already?
But I'm not on a Mac right now so I don't know how to even make a real one at the moment other than that LaTeX method.
I guess on Windows it's Alt+0,1,5,1 on a numpad. Or you copy+paste from Character Map.
I find it a bit sad that using proper typography is now frowned upon, but it seems that ship has sailed.
But British usage – instead – uses spaces, so an en-dash or an em-dash is acceptable.
≤ ≥ ≠ × — – “ ” ’ ° … ¹ ² ³ ™ • ♣ ♢ ♡ ♠
If you work in languages other than English but have a standard English keyboard layout, a compose key is handy for typing accents and non-English letters/ligatures too.
[0] The AltGrDead variant just means that the regular dead keys on the US Intl are flipped; e.g. ' is now no longer dead per default: I have to hit altgr+' to make it dead (i.e. an acute accent (´)).
[2] https://console.cloud.google.com/marketplace/product/y-combi...
https://www.gally.net/miscellaneous/hn-em-dash-user-leaderbo...
This second version was vibe-coded with Codex CLI. I also tried Gemini CLI, but it didn’t work very well. The SQL scripts I ran at BigQuery were by Claude.
I am not a programmer or web designer, so I will leave these pages as they are, warts and all. It was a fun project, though. I never would have attempted something like this pre-vibe-coding.
Otherwise it looks like the "race" is biased towards just the amount of comment posted.
Probably some autocomplete related software release.
https://daringfireball.net/2018/02/ios_messages_smart_punctu...
A lot of symbols can be accessed with Alt Gr compared to Windows
I have also configured Shift+Compose to send the code 'dead_greek' using ~/.Xmodmap:
Then I can type α, β, γ, Δ, Ε, Ζ easily, although I hardly ever need this nowadays.- you can’t make a ?.. or !.. with it
- the spacing between the dots is awful in a lot of fonts
- it is hideous in monospace
- typing ellipsis properly is a very easy gesture (triple-tap the dot key), arguably easier than Alt Gr + . (depending on the keyboard)
But an ellipsis is separate from and doesn't mmerge with sentence-terminal punctuation, whether its a period or somethig else (when it replaces words at the end of a sentence, the terminal punctuation follows the ellipsis, when at the beginning of a sentence that follows another, the ellipsis follows the punctuation.) The constructs you say can't be formed with it aren't needed.
Meanwhile there are a lot of languages and cultures. Somewhere all those characters were useful for something. My Atari had a very fun utility that gave you a compose-key that could combine just about everything on the keyboard to access all those weird characters of the extended ascii table. <compose>+ao would give you "a" with a ring on top (å), <compose>+ae gave the danish welded together character that I can't even type any more on windows.
The idea came from some unix thing I believe.
Feature request 2: Em-dash regular-dash ratio.
What's a “regular dash”?
Hyphen-minus (which isn't even a dash at all)? En-dash? Figure dash?
Is the amount of em dashes counted or the comments that have at least one em dash inside them?
You know, I am asking for...science(?).
I also wanted to point out that these could be Kantonese/Mandarin/Japanese/SouthEast Asian users that use their local keymapping software because a lot of them use the idiom symbols (e.g. the dot character, too) when they switch to the English keymaps.
Check out how laptops usually look like over there, a lot of manufacturers build that right into the firmware.
On the other hand, I don’t think o3 was ever a common choice among people copying from LLMs, so en dashes remain infrequent regardless.
J/k:)
You can't write CO₂ or m², use a fraction like ½, claim © or mention a price in Euros or Pounds Sterling.
You can't even write major American place names (San José, Oʻahu).
The em-dash used to be a slightly snooty way for Mac users to announce themselves. Sad that the polarity of perception has reversed.
I’ve been typing em-dashes since I got my first MacBook in 2006 and I’m not going to let the AI companies take my beautiful punctuation away from me.
Compose-minus-minus-minus in X
It's one of the long-press punctuation marks on Android
Option-shift-minus on Mac
I remap a key to the right of Space to Compose, and add various custom sequences. Before long, I was completely comfortably and casually typing dashes and curly quotes and more, and in fact it takes conscious effort for me to limit myself to ASCII when typing prose. (Writing code, writing *, /, -, ' and " is easy. But writing prose, I genuinely will write ×, ÷ if it feels the right one in that place, −, ‘/’ and “/”.)
On one previous laptop keyboard I mapped Menu, on my current one RAlt is more suitable.
When on Windows, I use WinCompose. On Linux, I used to just use it bare, which had advantages and disadvantages—apps implement a Compose key inconsistently, some messing things up related to includes and some handling overlapping sequences differently. More recently I wanted to be able to type Telugu and installed fcitx5 which is no longer mostly broken under Wayland like it was last time I tried, so now fcitx5 is handling the Compose sequences across the entire system, and working more consistently. Also I can use Ctrl+Alt+Shift+U and get a popup where I can search Unicode by code or description. Now if only that pesky popup would handle Shift+Space and Ctrl+Backspace itself rather than letting them fall through to the parent…
In my ~/.config/sway/config:
(caps:backspace isn’t entirely relevant here, but it’s on the same line and I choose to mention it. When people are remapping Caps Lock, I’ve never understood why so many seem to choose to make it Escape. Just extend the left hand and slap the corner of the keyboard with the ring finger, it’s not a huge movement and is easy to reach and return. Backspace, however, tends to be needed at least as often (and yes, I say that despite using Vim), and is much harder to hit. In my mind, a far better candidate for shifting to that prime real estate.)For my ~/.XCompose, I start with the defaults and one good set of additions, https://raw.githubusercontent.com/kragen/xcompose/master/dot...:
Then I add all kinds of additions. Lots of fine typography stuff like zero-width space and non-joiner, narrow no-break space, thin space… a few more hyphen/dash mappings… and lots of other things like nice emoji sequences, music notation stuff, Greek letters matching Vim digraphs, superscript ordinals (ˢᵗ, ⁿᵈ, ʳᵈ, ᵗʰ), the keyboard shortcut symbols macOS uses (⌘⌃⌥⇧⌫ and another dozen less common ones), control pictures like ␆, and a handful of other things.When all’s said and done:
• Compose - - - gets me — EM DASH (stock)
• Compose - - . gets me – EN DASH (stock)
• Compose - - = gets me − MINUS SIGN (custom)
• Compose - - w gets me ⸺ TWO EM DASH (custom; w for wide)
• Compose - - W gets me ⸻ THREE EM DASH (custom; W for Wider)
The last two I use occasionally, the other three I use very frequently. I went through a phase of using HYPHEN and SOFT HYPHEN, now I seldom use them.
I also like to write &c. (italic where supported) for et cetera.
For quotation marks, I also use custom mappings:
Think about how you physically type them, and I reckon these mappings make a lot of sense, very easy to type. Much better than the stock bindings (<' >' <" >") or kragen ones (`Space 'Space `` ''; or 6' 9' 6" 9").—⁂—
(Oh yeah, that one’s <Multi_key> <h> <r> : "—⁂—".)
Now, I have one question I’d like answered. Overlapping sequences. If you have -> → and <- ← you’re fine, but when you add <-> ↔, I can’t find any way of using the <- sequence any more. Before fcitx5, some apps would ignore one or the other (in ways difficult to explain which I think involved the fact that some definitions came from includes), and some would let you terminate the sequence early and match the shorter one (e.g. Compose < - Enter). Is there some proper solution I’ve missed?
I have plans for an article on my keyboard arrangements, including sharing a full .XCompose, but I’m going to finish my next major revision to my website first. Because then I’ll be able to draw things instead of just writing.
—⁂—
On mobile, I think I use FUTO keyboard at present, which lets me access most of these things, but not elegantly. I want to make my own keyboard layout that lets me access the good stuff more easily, but I haven’t got to it yet.
Also: anyone want to join me in advocating for completion dictionaries and libraries to replace their ' apostrophes with ’, or at least to support both approaches equally? I’m fed up with not having this stuff, Vim is the only place where it was straightforward to get it about right, and mobile is just a mess.
Whether this is interesting or not, well…
[1] https://news.ycombinator.com/item?id=45046883
What's needed is a writing comparison before/after 2022 for these users. If there's a sudden 200% increase in the use of em-dashes from one month to the next, it's a very strong indicator that the user started LLMing their posts.