[#11439] comments needed for Random class — "NAKAMURA, Hiroshi" <nakahiro@...>

-----BEGIN PGP SIGNED MESSAGE-----

15 messages 2007/06/12

[#11450] Re: new method dispatch rule (matz' proposal) — David Flanagan <david@...>

This is a late response to the very long thread that started back in

17 messages 2007/06/13

[#11482] Ruby Changes Its Mind About Non-Word Characters — James Edward Gray II <james@...>

Does this look like a bug to anyone else?

10 messages 2007/06/16

[#11505] Question about the patchlevel release cycle — Sylvain Joyeux <sylvain.joyeux@...4x.org>

1.8.6 thread support was broken in bad ways. It stayed for three months

20 messages 2007/06/20
[#11512] Re: Question about the patchlevel release cycle — Urabe Shyouhei <shyouhei@...> 2007/06/20

Hi, I'm the 1.8.6 branch manager.

[#11543] Re: Apple reportedly to ship with ruby 1.8.6-p36 unless informed what to patch — James Edward Gray II <james@...>

On Jun 27, 2007, at 4:47 PM, Bill Kelly wrote:

10 messages 2007/06/27

Re: Ruby Changes Its Mind About Non-Word Characters

From: "Vincent Isambart" <vincent.isambart@...>
Date: 2007-06-17 09:58:09 UTC
List: ruby-core #11494
> > It is because the and サ characters are not in ISO-8859-1.
>
> The characters were in UTF-8, which is why I asked iconv to convert
> them.  Don't take my word for it though, I gave you the source so you
> could double-check me.

Sorry, my mistake, ォ and サ are indeed in ISO 8859-1. It's just that
they were displayed as question mark which is the default character
used by iconv when a characters is not in the destination encoding.
But here it seems it's the terminal that displays it, not iconv.

> > The ? character has the same code in both UTF-8 and ISO-8859-1 as it
> > is an ASCII character...
>
> My text editor and terminal don't agree
> Firefly:~/Desktop$ cat saved_as_utf8.txt saved_as_latin1.txt
> ォサ
> ??
> Firefly:~/Desktop$ ruby -e 'p ARGF.read' saved_as_utf8.txt
> saved_as_latin1.txt
> "\302\253\302\273\n\253\273\n"

In fact nobody is wrong. I was talking about the question mark (?)
character that is indeed an ASCII character and so the has the same
code in both UTF-8 and ISO 8859-1.
The "question mark" you are refering to is not a question mark but an
invalid UTF-8 character that is displayed by the terminal as a
question mark.

> I'd view ォ and サ as the same kind of punctuation as ". So it's not a
> letter. And \b should match at letter <> non-letter boundaries.

I agree that the ォ and サ are logically quotes (after all I'm French
and they are used a lot in French), but what I said is that the
Unicode support for Ruby 1.8 is poor (in fact except split(//) you
can't do much) so I am not that suprised by the fact it does not work.


In This Thread