From: duerst@... Date: 2018-10-08T10:25:34+00:00 Subject: [ruby-core:89312] [Ruby trunk Feature#14839] How to deal with capitalizing Georgian in Unicode 11.0.0 Issue #14839 has been updated by duerst (Martin D��rst). spixi (Marius Spix) wrote: > The current implementation of String.capitalize is not just a problem in Georgian, but also in other languages like Dutch. Words beginning with ���ij��� must be titlecased with a leading ���IJ���, e. g. ���IJsbeer��� (polar bear). This should be also considered when thinking about redesigning the case mapping code. Thanks for this information. The problem with this is that it is language-specific, i.e. it doesn't apply to all words starting with "ij" in all languages. Also, there's a character, '��', that correctly upcases to '��'. Unfortunately, it's not very much used in Dutch text. ---------------------------------------- Feature #14839: How to deal with capitalizing Georgian in Unicode 11.0.0 https://bugs.ruby-lang.org/issues/14839#change-74341 * Author: duerst (Martin D��rst) * Status: Feedback * Priority: Normal * Assignee: duerst (Martin D��rst) * Target version: ---------------------------------------- This is a request for feedback. In particular if you are from Georgia (the country, not the US state), or if you know somebody (who knows somebody,...) from Georgia, feedback on this issue is very much appreciated. If I don't get any feedback, I'll precede as explained below. Unicode 11.0.0 introduces an upper-case version of present-day Georgian letters called Mtavruli (the lower case letters are called Mkhedruli). Mtavruli letters are only used to empthasize whole words; there is no initial-letter capitalization in Georgian. Therefore, the Mkhedruli letters do not have Mtavruli letters as their titlecase, but are explicitly mapped to themselves. This means that in Ruby, `mkhedruli.capitalize` would be a no-op although `mkhedruli.upcase` would convert to Mtavruli letters. Additional pointers: http://www.unicode.org/versions/Unicode11.0.0/#Migration http://www.unicode.org/charts/PDF/Unicode-11.0/U110-1C90.pdf http://www.unicode.org/versions/Unicode11.0.0/ch07.pdf (Section 7.7, Georgian, pp. 320-321) -- https://bugs.ruby-lang.org/ Unsubscribe: