←back to thread

296 points todsacerdoti | 1 comments | | HN request time: 0.204s | source
Show context
rryan ◴[] No.44373939[source]
Don't make me tap the sign: There is no such thing as "bytes". There are only encodings. UTF-8 is the encoding most people are using when they talk about modeling "raw bytes" of text. UTF-8 is just a shitty (biased) human-designed tokenizer of the unicode codepoints.
replies(2): >>44377004 #>>44377091 #
hiddencost ◴[] No.44377091[source]
Well akshually...

I assume you started programming some time this millennia? That's the only way I can explain this "take".

replies(2): >>44377568 #>>44385622 #
1. roflcopter69 ◴[] No.44377568[source]
Care to elaborate?