[#11890] Ruby and Solaris door library — "Hiro Asari" <asari.ruby@...>

Hi, there. This is my first patch against ruby. I think I followed

19 messages 2007/08/13
[#11892] Re: Ruby and Solaris door library — Daniel Berger <djberg96@...> 2007/08/14

Hiro Asari wrote:

[#11899] pack/unpack 64bit Integers — Hadmut Danisch <hadmut@...>

Hi,

13 messages 2007/08/14
[#11903] Re: pack/unpack 64bit Integers — Brian Candler <B.Candler@...> 2007/08/15

On Wed, Aug 15, 2007 at 06:50:01AM +0900, Hadmut Danisch wrote:

[#11948] Fibers in Ruby 1.9? — David Flanagan <david@...>

I just noticed that my ruby1.9 build of August 17th includes a Fiber

22 messages 2007/08/22
[#11949] Re: Fibers in Ruby 1.9? — Daniel Berger <djberg96@...> 2007/08/22

David Flanagan wrote:

[#11950] Re: Fibers in Ruby 1.9? — "Francis Cianfrocca" <garbagecat10@...> 2007/08/22

On 8/22/07, Daniel Berger <djberg96@gmail.com> wrote:

[#11952] Re: Fibers in Ruby 1.9? — MenTaLguY <mental@...> 2007/08/22

On Wed, 22 Aug 2007 20:50:12 +0900, "Francis Cianfrocca" <garbagecat10@gmail.com> wrote:

[#11988] String#length not working properly in Ruby 1.9 — "Vincent Isambart" <vincent.isambart@...>

I saw that Matz just merged his M17N implementation in the trunk.

17 messages 2007/08/25
[#11991] Re: String#length not working properly in Ruby 1.9 — "Michael Neumann" <mneumann@...> 2007/08/25

On Sat, 25 Aug 2007 10:54:20 +0200, Yukihiro Matsumoto

[#11992] Re: String#length not working properly in Ruby 1.9 — Yukihiro Matsumoto <matz@...> 2007/08/25

Hi,

[#12042] Encodings of string literals; explicit codepoint escapes? — David Flanagan <david@...>

This message contains queries that probably only Matz can answer:

16 messages 2007/08/31
[#12043] Re: Encodings of string literals; explicit codepoint escapes? — Yukihiro Matsumoto <matz@...> 2007/08/31

Hi,

Encodings of string literals; explicit codepoint escapes?

From: David Flanagan <david@...>
Date: 2007-08-31 06:52:51 UTC
List: ruby-core #12042
This message contains queries that probably only Matz can answer:

I'm excited to see that strings have encodings now!  Thank you for your 
Unicode support!  I have a few questions:

1) I gather that string literals are given the encoding specified by the 
-K option or by the encoding comment at the top of the file.  Do you 
plan any changes to the string literal syntax so that encodings can be 
specified for individual literals?  Will I be able to include a utf-8 
encoding string literal within a file that is otherwise in ASCII? I 
don't like Python's u"" syntax, but I'm hoping that you'll provide some 
more elegant alternative.

2) This is really part of the same question: will you extend the string 
literal syntax to allow the inclusion of arbitrary codepoints in 
ASCII-encoded files using some kind of character escape?  I'm accustomed 
to Java's \uxxxx escape sequence and would like to see something like 
this.  (I don't know enough about SJIS and EUC to know if that would be 
relevant to those encodings or not.)

Despite my relative ignorance, I suggest something along these lines:

\uxxxx: represents Unicode codepoint U+xxxx
\Uxxxxxx: represents Unicode codepoint U+xxxxxx
\Exxxx: represents EUC codepoint xxxx
\Sxxxx: repersents SJIS codepoint xxxx

xxxx: is a string of four hex digits.

A string may not mix codepoints from different encodings.

If a string contains a codepoint escape, it gets its encoding from that 
escape.

If a string literal ends with \u, \U, \E, or \S (with no hex digits 
following) then the escape specifies the encoding of the string, even 
when the string does not contain any characters outside of the ASCII subset.

     David

In This Thread

Prev Next