[#11852] continuations in Ruby 1.9? — David Flanagan <david@...>
In a comment on my recent blog post
On 8/6/07, David Flanagan <david@davidflanagan.com> wrote:
[#11860] Is this really what we want? — James Edward Gray II <james@...>
I'm investigating some recent breakage in FasterCSV and have tracking
Hi,
[#11871] ruby-openssl: == incorrect for X509-Subjects — Hadmut Danisch <hadmut@...>
Hi,
[#11876] priorities of newly-created threads — David Flanagan <david@...>
Hi,
Hi,
[#11886] Core dump with simple web scraper when run via cron — Daniel Berger <Daniel.Berger@...>
Hi all,
[#11890] Ruby and Solaris door library — "Hiro Asari" <asari.ruby@...>
Hi, there. This is my first patch against ruby. I think I followed
Hiro Asari wrote:
On 8/13/07, Daniel Berger <djberg96@gmail.com> wrote:
> -----Original Message-----
On 8/15/07, Berger, Daniel <Daniel.Berger@qwest.com> wrote:
[#11893] UDP sockets raise exception on MIPS platform — Brian Candler <B.Candler@...>
I am running ruby-1.8.6 under OpenWrt (*), which is a small MIPS platform
[#11894] IO#seek and whence problem — Bertram Scharpf <lists@...>
[#11899] pack/unpack 64bit Integers — Hadmut Danisch <hadmut@...>
Hi,
On Wed, Aug 15, 2007 at 06:50:01AM +0900, Hadmut Danisch wrote:
On Wed, Aug 15, 2007 at 02:45:20PM +0900, Brian Candler wrote:
On Fri, Aug 17, 2007 at 05:17:09PM +0200, Hadmut Danisch wrote:
Dumb question of the day: are Kernel#proc and Kernel#lambda identical?
> Dumb question of the day: are Kernel#proc and Kernel#lambda identical?
[#11900] missing bison, gperf not detected, do I need ruby to build ruby? — "Gabor Szabo" <szabgab@...>
Hi,
On Wed, 15 Aug 2007, Gabor Szabo wrote:
> > It seems ./configure did not detect the fact that bison was missing from
[#11930] Bug in select? — "Robert Dober" <robert.dober@...>
Hi
[#11945] Smoke testing Ruby — "Gabor Szabo" <szabgab@...>
Hi,
On 8/21/07, Gabor Szabo <szabgab@gmail.com> wrote:
Ruby used to have the Triple-R project based on Rubicon: see
Hugh Sasse wrote:
[#11947] Splatting MatchData bug? — Jos Backus <jos@...>
$ /tmp/ruby-1.9/bin/ruby -v
[#11948] Fibers in Ruby 1.9? — David Flanagan <david@...>
I just noticed that my ruby1.9 build of August 17th includes a Fiber
David Flanagan wrote:
On 8/22/07, Daniel Berger <djberg96@gmail.com> wrote:
On Wed, 22 Aug 2007 20:50:12 +0900, "Francis Cianfrocca" <garbagecat10@gmail.com> wrote:
On 8/22/07, MenTaLguY <mental@rydia.net> wrote:
On Thu, 23 Aug 2007 00:57:01 +0900, "Francis Cianfrocca" <garbagecat10@gmail.com> wrote:
[#11960] coroutines with Fiber::Core — David Flanagan <david@...>
The following code works on Linux with today's snapshot of 1.9:
Hi,
[#11981] Inverse Square Root — "Dave Pederson" <dave.pederson@...>
Hello-
[#11988] String#length not working properly in Ruby 1.9 — "Vincent Isambart" <vincent.isambart@...>
I saw that Matz just merged his M17N implementation in the trunk.
Hi,
On Sat, 25 Aug 2007 10:54:20 +0200, Yukihiro Matsumoto
Hi,
On 8/25/07, Yukihiro Matsumoto <matz@ruby-lang.org> wrote:
Hi,
On 8/25/07, Yukihiro Matsumoto <matz@ruby-lang.org> wrote:
[#12025] how to build ruby on vms — "toni" <toni@...>
Hi,
[#12040] Pragmas in Ruby 1.9 — David Flanagan <david@...>
Hi,
[#12042] Encodings of string literals; explicit codepoint escapes? — David Flanagan <david@...>
This message contains queries that probably only Matz can answer:
Hi,
Yukihiro Matsumoto wrote:
On 8/31/07, Yukihiro Matsumoto <matz@ruby-lang.org> wrote:
Re: Encodings of string literals; explicit codepoint escapes?
For what it's worth, I don't think that the \N{name} escape is
necessary, even in the standard library. Unicode names are so long
(like "ARABIC LIGATURE SALLALLAHOU ALAYHE WASALLAM") that I think this
notation would be terribly cumbersome.
Also, \N escapes could easily be approximated (at runtime, instead of
compile time) with #{} interpolation. Define a class UN (for Unicode
Name) and give it a const_missing method to look up (and define
permanently) codepoints by name. Then you can write strings like
"#{UN.COPYRIGHT_SIGN} 2007". Which isn't much longer or harder than
"\N{COPYRIGHT SIGN} 2007"
I've done something similar (for codepoints instead of names) with
const_missing here: http://www.davidflanagan.com/blog/2007_08.html#000136
David
Yukihiro Matsumoto wrote:
> Hi,
>
> In message "Re: Encodings of string literals; explicit codepoint escapes?"
> on Fri, 31 Aug 2007 15:52:51 +0900, David Flanagan <david@davidflanagan.com> writes:
>
> |I'm excited to see that strings have encodings now! Thank you for your
> |Unicode support! I have a few questions:
> |
> |1) I gather that string literals are given the encoding specified by the
> |-K option or by the encoding comment at the top of the file. Do you
> |plan any changes to the string literal syntax so that encodings can be
> |specified for individual literals? Will I be able to include a utf-8
> |encoding string literal within a file that is otherwise in ASCII? I
> |don't like Python's u"" syntax, but I'm hoping that you'll provide some
> |more elegant alternative.
>
> We will provide "binary" string literals probably via b"" or ""b (not
> fixed yet). If you want to have string encoded in utf-8 in ASCII
> coded script, you can have utf-8 binary string in binary then specify
> utf-8 later, e.g.
>
> # my last name in Japanese
> m = b"\343\201\276\343\201\244\343\202\202\343\201\250"
> m.encoding="utf8"
>
> or possible alternative in the distant future may be:
>
> m = "\343\201\276\343\201\244\343\202\202\343\201\250".utf8
> m = "\343\201\276\343\201\244\343\202\202\343\201\250"u
> m = "\343\201\276\343\201\244\343\202\202\343\201\250"e:utf8
>
> |2) This is really part of the same question: will you extend the string
> |literal syntax to allow the inclusion of arbitrary codepoints in
> |ASCII-encoded files using some kind of character escape? I'm accustomed
> |to Java's \uxxxx escape sequence and would like to see something like
> |this. (I don't know enough about SJIS and EUC to know if that would be
> |relevant to those encodings or not.)
> |
> |Despite my relative ignorance, I suggest something along these lines:
> |
> |\uxxxx: represents Unicode codepoint U+xxxx
> |\Uxxxxxx: represents Unicode codepoint U+xxxxxx
> |\Exxxx: represents EUC codepoint xxxx
> |\Sxxxx: repersents SJIS codepoint xxxx
> |
> |xxxx: is a string of four hex digits.
>
> We just had a meeting to discuss about issues like this yesterday.
> And the end result was
>
> \xXX -> single byte
> \uXXXX -> single Unicode character by codepoint (BMP)
> \u{XXXXXXXX} -> single Unicode character up to 4 bytes
> \N{name} -> single character by name
>
> But you need to require additional library to get:
>
> * characters from Unicode name
> * Unicode character embedded in non-Unicode encoding strings
>
> |If a string literal ends with \u, \U, \E, or \S (with no hex digits
> |following) then the escape specifies the encoding of the string, even
> |when the string does not contain any characters outside of the ASCII subset.
>
> This is an interesting idea. We haven't made a way to specify
> encoding of literals yet. This might be an input for inspiration.
>
> matz.
>