Cool. One thing I found odd was that on export there are two listed formats. "ASCII" and "ASCII extended" but as far as I can tell, the ASCII version is actually outputting UTF-8. It's hard to tell for sure though because the output is just text that you can cut and paste and so it's difficult to know what conversions the browser or OS might be doing behind the scenes. But when I paste it into a text editor on my mac, it's definitely UTF-8, not ASCII encoded.
Which is probably more useful anyway given that if it really outputted ASCII encoded line drawing characters, you'd end up with gibberish on a system that assumed UTF-8 encoding.
numpad0 · 21m ago
20 20 78 78 78 ... Looks ASCII to me, Firefox on Windows. Could be OS.
ilovetux · 40m ago
Disclaimer, I just pulled this quote from Google ai which probably took it from somewhere else, but I just wanted to provide a little context. ASCII encoded text is also valid utf8.
> The first 128 characters of Unicode, which are the same as the ASCII character set (characters 0-127), are encoded in UTF-8 using a single byte with the exact same binary value as their ASCII representation. This means that any file containing only ASCII characters is also a valid UTF-8 file
staplung · 14m ago
Yes, but the box drawing characters in ASCII are all above 127 so they don't encode the same way. So that last AI generated sentence is false: ASCII files that consist only of characters in the lower 127 will also be valid UTF-8. But ASCII files that use characters above 127 will not be valid UTF-8.
Basically, if the high bit is set in UTF-8 it indicates that more than one byte is needed to represent the code point.
em3rgent0rdr · 28m ago
Indeed, UTF-8 "was designed for backward compatibility with ASCII: the first 128 characters of Unicode, which correspond one-to-one with ASCII, are encoded using a single byte with the same binary value as ASCII, so that a UTF-8-encoded file using only those characters is identical to an ASCII file."
Cool concept, but I couldn't get everything to work; namely the "Select & Move" on individual boxes or lines. The select area and move did work, though.
But now on web!
Which is probably more useful anyway given that if it really outputted ASCII encoded line drawing characters, you'd end up with gibberish on a system that assumed UTF-8 encoding.
> The first 128 characters of Unicode, which are the same as the ASCII character set (characters 0-127), are encoded in UTF-8 using a single byte with the exact same binary value as their ASCII representation. This means that any file containing only ASCII characters is also a valid UTF-8 file
Basically, if the high bit is set in UTF-8 it indicates that more than one byte is needed to represent the code point.
https://en.wikipedia.org/wiki/UTF-8
2022: https://news.ycombinator.com/item?id=30273299
2021: https://news.ycombinator.com/item?id=27536253
2018: https://news.ycombinator.com/item?id=16051428
2014: https://news.ycombinator.com/item?id=7085133