[#4065] Surprise in Time#sec — Steven Jenkins <steven.jenkins@...>
This bit me:
[#4067] Segfault in Thread#initialize / caller — Florian Gro<florgro@...>
Moin!
[#4076] Ruby/DL — Jamis Buck <jamis_buck@...>
I recently used Ruby/DL to create bindings to the SQLite3 embedded
On Tue, Jan 04, 2005 at 02:53:49AM +0900, Jamis Buck wrote:
>>>>> "P" == Paul Brannan <pbrannan@atdesk.com> writes:
On Wed, Jan 05, 2005 at 03:05:48AM +0900, ts wrote:
>>>>> "P" == Paul Brannan <pbrannan@atdesk.com> writes:
On Thu, Jan 06, 2005 at 01:10:34AM +0900, ts wrote:
>>>>> "P" == Paul Brannan <pbrannan@atdesk.com> writes:
On Thu, Jan 06, 2005 at 06:57:57PM +0900, ts wrote:
>>>>> "P" == Paul Brannan <pbrannan@atdesk.com> writes:
On Fri, Jan 07, 2005 at 12:06:16AM +0900, ts wrote:
>>>>> "P" == Paul Brannan <pbrannan@atdesk.com> writes:
ts wrote:
[#4116] Test::Unit::Collector::Dir won't work with code that modifies $LOAD_PATH — Eric Hodel <drbrain@...7.net>
Any test code that depends upon modifications of $: fails when used
Hi,
On 11 Jan 2005, at 04:14, nobu.nokada@softhome.net wrote:
On 11 Jan 2005, at 09:39, Eric Hodel wrote:
On Sat, 15 Jan 2005 04:06:10 +0900, Eric Hodel <drbrain@segment7.net> wrote:
On Fri, 14 Jan 2005 23:48:58 -0500, Nathaniel Talbott
On Thu, 27 Jan 2005 17:17:14 -0500, Nathaniel Talbott
[#4146] The face of Unicode support in the future — Charles O Nutter <headius@...>
Hello Rubyists!
Hi,
Yukihiro Matsumoto <matz@ruby-lang.org> writes:
Paul Brannan <pbrannan@atdesk.com> writes:
Hi,
On Mon, Jan 10, 2005 at 11:53:48PM +0900, Yukihiro Matsumoto wrote:
Hi,
Yukihiro Matsumoto wrote:
Hi,
On Wed, Jan 12, 2005 at 02:13:35PM +0900, Yukihiro Matsumoto wrote:
Hi,
[#4189] Authenticated proxy support for open-uri — Neil Kohl <nakohl@...>
Hello!
[#4232] Carriage return on shebang — Florian Gro<florgro@...>
Moin.
[#4242] tracer.rb: Do not list pseudo source lines of binary extensions — Florian Gro<florgro@...>
Moin.
[#4243] Patch that enables https in open-uri.rb — Michael Neumann <mneumann@...>
Hi,
In article <41E93F42.9090705@ntecs.de>,
Tanaka Akira wrote:
[#4269] Re: The face of Unicode support in the future — Wes Nakamura <wknaka@...>
Hi,
Hi,
Yukihiro Matsumoto wrote:
Hi,
[#4296] parse_c.rb: allow whitespace after function names — Tilman Sauerbeck <tilman@...>
Hi,
Hi,
Yukihiro Matsumoto <matz@ruby-lang.org> [2005-01-21 17:43]:
[#4311] RFE: Enumerable#group_by, Array#^ — Florian Gro<florgro@...>
Moin.
[#4323] test/unit doesn't rescue a Exception — Tanaka Akira <akr@...17n.org>
test/unit doesn't rescue a Exception in a test method, as follows.
In article <87is5jb46q.fsf@serein.a02.aist.go.jp>,
On 9/1/06, Tanaka Akira <akr@fsij.org> wrote:
On Sep 2, 2006, at 6:34 PM, Nathaniel Talbott wrote:
In article <A604C0B3-95ED-4B9B-866C-79A2C7D5E3C4@segment7.net>,
On Sep 2, 2006, at 9:39 PM, Tanaka Akira wrote:
In article <622DAC7E-55DB-4854-B82B-A037CE9C75EF@segment7.net>,
In article <87ac5hv4bo.fsf@fsij.org>,
On Sep 3, 2006, at 8:21 AM, Tanaka Akira wrote:
[#4332] IO#clearerr missing in action — Eric Hodel <drbrain@...7.net>
I wanted to implement tail(1) in ruby cleanly, but found the best I
[#4335] When will Object#type disappear? — "David A. Black" <dblack@...>
Hi --
Re: The face of Unicode support in the future
Note: this message contains UTF-8 characters.
On Thu, 13 Jan 2005 07:56:22 +0900, Yukihiro Matsumoto
<matz@ruby-lang.org> wrote:
> In message "Re: The face of Unicode support in the future" on Thu,
> 13 Jan 2005 01:35:36 +0900, Christian Neukirchen
> <chneukirchen@gmail.com > writes:
>| This sounds likely to result in duplicated efforts... Do it
>| pragmatically; I don't think it should be very hard to provide a
>| default Character class that people can "customize" by
>| subclassing or method redefinition.
> A character might be represented by either:
> * code point
> * sequence of code points
> * or even set of attributes, without any code point
> * or something totally different
> But never mind. I'm no expert. I just don't want to repeat the
> argument again in English.
I'm not going to claim to be an expert, but in my work I have had to
look this over extensively -- mostly from the perspective of needing
to support Unicode, but also from the perspective of dealing with
unknown code pages. I'm not really understanding the last two points
in your list (a set of attributes, without any code point; something
totally different).
In Unicode encodings, at least, a code point can be multiple bytes
(or multiple words in UTF-16 with surrogates) -- and a character can
be multiple code points for the combining characters (e.g., can be
represented as combining-' + e, but there are explicit rules for how
combining characters can be specified in order).
It's important to note that in filenames, at least, Windows will
represent most characters as composed (e.g., and the Mac will
represent most characters as decomposed (e.g., 'e). These are,
however, the same Character. So when working with Unicode strings,
If I have
a = "r駸um蘂
then when I do:
a[1]
I expect to get a Character of "蘂. With this Character, I should be
able to extract the composed codepoint(s) as well as the decomposed
codepoint(s) -- there are regular transformations available in
Unicode for this matter.
I do not necessarily expect to be able to do these transformations
on normal Strings -- but I do expect to be able to do these
transformations on I18N/M17N Ruby with multiple String encoding
support. Whether it's as a Character class or something that I apply
to a String, I don't particularly care.
I fully expect that you'll have a sensible API for this when you get
it added, but I would like such standard handling -- and a standard
way of adding new handling -- in Ruby 2.0.
Have you looked at ICUC (IBM's International Components for Unicode
for C)? It uses a UTF-16 encoding internally, but it supports quite
a bit of what I'm talking about already.
-austin
--
Austin Ziegler * halostatue@gmail.com
* Alternate: austin@halostatue.ca