[#35446] [Ruby 1.9 - Bug #4477][Open] Kernel:exec and backtick (`) don't work for certain system commands — Joachim Wuttke <j.wuttke@...>

10 messages 2011/03/07

[#35476] [Ruby 1.9 - Bug #4489][Open] [PATCH] Encodings with /-(unix|dos|mac)\Z/ — "James M. Lawrence" <quixoticsycophant@...>

20 messages 2011/03/10

[#35552] [Ruby 1.9 - Feature #4523][Open] Kernel#require to return the path of the loaded file — Alex Young <alex@...>

14 messages 2011/03/24

[#35565] [Ruby 1.9 - Feature #4531][Open] [PATCH 0/7] use poll() instead of select() in certain cases — Eric Wong <normalperson@...>

33 messages 2011/03/28

[#35566] [Ruby 1.9 - Feature #4532][Open] [PATCH] add IO#pread and IO#pwrite methods — Eric Wong <normalperson@...>

12 messages 2011/03/28

[#35586] [Ruby 1.9 - Feature #4538][Open] [PATCH (cleanup)] avoid unnecessary select() calls before doing I/O — Eric Wong <normalperson@...>

9 messages 2011/03/29

[ruby-core:35541] Re: [Feature #2350](Rejected) Unicode specific functionality on String in 1.9

From: Cezary <cezary.baginski@...>
Date: 2011-03-22 17:30:04 UTC
List: ruby-core #35541
On Fri, Mar 18, 2011 at 09:52:27PM +0900, Nikolai Weibull wrote:
> On Fri, Mar 18, 2011 at 11:53, Magnus Holm <judofyr@gmail.com> wrote:
> > It's far easier for everyone that the built-in #upcase is
> > simple and fast and you'll have to be explicit about any
> > other I18n stuff IMO.
>
> Easy, perhaps, but hardly useful.

A agree - for human interaction it is completely useless. I tend to
think of #upcase as just a convenience method for dealing with ASCII
only system level functionality, e.g. paths on filesystems,
environment variables, html tags, (un)capitalizing to get class names,
database table names, etc.

Anything else is "no-op" or "undefined" for me.

> My point is that the current #upcase (and similar methods) is
> basically useless for anything other than ASCII.

I would probably go one step further and disallow upcase and friends
for any non-US-ASCII string for this reason. At least issue a warning.

> If this isn=E2=80=99t of interest, then I=E2=80=99m still looking for a w=
ay to
> override #upcase for Strings that use the UTF-8 encoding without
> resorting to alias_method or extend (as shown earlier in this
> discussion).  This seems impossible to do at the moment, as Encoding
> is a completely opaque object.

Correct me if I am wrong, but even "upper case" as a concept is not
common among all languages - an implementation detail for specific
cases at best.

For example, in German, you may want a more meaningful 'to_noun'
instead of 'capitalize'. For Japanese some may want upcase as a no-op
and some as a hack to convert to katakana. For case insensitivity,
probably a "normalize" method would be more descriptive.

Out of curiosity: in what specific case is utf upcase necessary?

--=20
Cezary Baginski

Attachments (1)

signature.asc (198 Bytes, application/pgp-signature)

In This Thread