←back to thread

71 points zobweyt | 2 comments | | HN request time: 0.445s | source
Show context
frizlab ◴[] No.43553870[source]
Great library!

Does it support non-English title casing?

For instance in French, title casing for “les maisons bleues” is “Les Maisons bleues” while for “des maisons bleues” it’s “Des maisons bleues”.

replies(1): >>43553910 #
zobweyt ◴[] No.43553910[source]
Thanks!

It does not support non-English title casing. From the documentation:

> It also works non-ascii characters. However, no inferences on the language itself is made. For instance, the digraph ij in Dutch will not be capitalized, because it is represented as two distinct Unicode characters. However, æ would be capitalized

replies(3): >>43554503 #>>43554857 #>>43555116 #
frizlab ◴[] No.43554857[source]
I was talking about the specific rules that are in place for title capitalization. As you can see in my example the uppercase letters seem randomly placed for a title, but they are indeed correct. For German too there are issues where capitalization has a meaning on the word itself. That kind of things.

It looks like your library does not support it, which is understandable, it is a huge problem to tackle, but I just wanted to be sure.

replies(1): >>43554915 #
1. zobweyt ◴[] No.43554915[source]
Thank you for the clarification! I understand that title capitalization can be quite complex, especially with specific rules in languages like German where capitalization can change the meaning of a word.

I guess handling these nuances falls under the broader categories of internationalization (i18n) and localization (l10n).

replies(1): >>43568972 #
2. frizlab ◴[] No.43568972[source]
Just to be excessively clear and maybe borderline annoying, this is not a simple nuance. In German the meaning of a word can actually change depending on its capitalization. Even in English, lowercasing the I is very weird.