[#63592] [ruby-trunk - Bug #10009] IO operation is 10x slower in multi-thread environment — normalperson@...
Issue #10009 has been updated by Eric Wong.
3 messages
2014/07/08
[#63682] [ruby-trunk - Feature #10030] [PATCH] reduce rb_iseq_struct to 296 bytes — ko1@...
Issue #10030 has been updated by Koichi Sasada.
3 messages
2014/07/13
[#63703] [ruby-trunk - Feature #10030] [PATCH] reduce rb_iseq_struct to 296 bytes — ko1@...
Issue #10030 has been updated by Koichi Sasada.
3 messages
2014/07/14
[#63743] [ruby-trunk - Bug #10037] Since r46798 on Solaris, "[BUG] rb_vm_get_cref: unreachable" during make — ngotogenome@...
Issue #10037 has been updated by Naohisa Goto.
3 messages
2014/07/15
[#64136] Ruby 2.1.2 (and 2.1.1 and probably others) assumes a libffi with 3 version numbers in extconf.rb — "Jeffrey 'jf' Lim" <jfs.world@...>
As per subject.
4 messages
2014/07/31
[#64138] Re: Ruby 2.1.2 (and 2.1.1 and probably others) assumes a libffi with 3 version numbers in extconf.rb
— "Jeffrey 'jf' Lim" <jfs.world@...>
2014/07/31
On Thu, Jul 31, 2014 at 6:03 PM, Jeffrey 'jf' Lim <jfs.world@gmail.com>
[ruby-core:64038] [CommonRuby - Feature #10084] Add Unicode String Normalization to String class
From:
matz@...
Date:
2014-07-26 06:18:00 UTC
List:
ruby-core #64038
Issue #10084 has been updated by Yukihiro Matsumoto.
First of all, I don't think `normalize` is the best name.
I propose `unicode_normalize` instead, since this normalization is sort of unicode specific.
We still need to define the detail.
It should raise an exception for non Unicode strings. It shouldn't convert to UTF-8 implicitly inside.
Matz.
----------------------------------------
Feature #10084: Add Unicode String Normalization to String class
https://bugs.ruby-lang.org/issues/10084#change-48051
* Author: Martin Dテシrst
* Status: Open
* Priority: Normal
* Assignee:
* Category:
* Target version:
----------------------------------------
Unicode string normalization is a frequent operation when comparing or normalizing strings.
This should be available directly on the String class.
The proposed syntax is:
'string'.normalize # normalize 'string' according to NFC (most frequent on the Web)
'string'.normalize :nfc # normalize 'string' according to NFC; :nfd, :nfkc, :nfkd also usable
'string'.nfc # shorter variant, but maybe too many methods
There are several "unofficial" but convenient normalization variants that could be offered, e.g.:
'string'.normalize :mac # use MacIntosh file system normalization variant
Implementations are already available in pure Ruby (easy for other Ruby implementations; e.g. eprun: https://github.com/duerst/eprun) and in C (unf,窶ヲ, http://bibwild.wordpress.com/2013/11/19/benchmarking-ruby-unicode-normalization-alternatives/)
---Files--------------------------------
Normalization.pdf (576 KB)
--
https://bugs.ruby-lang.org/