[#50466] [ruby-trunk - Bug #7492][Open] Segmentation fault at DL::TestDL#test_call_double on x64 Windows 8 — "phasis68 (Heesob Park)" <phasis@...>

23 messages 2012/12/02

[#50558] [ruby-trunk - Feature #7511][Open] short-circuiting logical implication operator — "rits (First Last)" <redmine@...>

12 messages 2012/12/04

[#50575] [ruby-trunk - Feature #7517][Open] Fixnum::MIN,MAX — "matz (Yukihiro Matsumoto)" <matz@...>

20 messages 2012/12/05

[#50755] Becoming a committer — Charlie Somerville <charlie@...>

Hi ruby-core,

21 messages 2012/12/11
[#50759] Re: Becoming a committer — Yukihiro Matsumoto <matz@...> 2012/12/11

Hi,

[#50784] Re: Becoming a committer — Charles Oliver Nutter <headius@...> 2012/12/11

It's really this easy? If so, I'll send over my public key today :)

[#50795] Re: Becoming a committer — Yukihiro Matsumoto <matz@...> 2012/12/11

Hi,

[#50806] [ruby-trunk - Feature #7548][Open] Load and Require Callbacks — "trans (Thomas Sawyer)" <transfire@...>

12 messages 2012/12/12

[#50810] [ruby-trunk - Feature #7549][Open] A Ruby Design Process — "brixen (Brian Ford)" <brixen@...>

34 messages 2012/12/12

[#50867] [ruby-trunk - Bug #7556][Assigned] test error on refinement — "usa (Usaku NAKAMURA)" <usa@...>

14 messages 2012/12/13

[#50900] [ruby-trunk - Bug #7564][Open] r38175 introduces incompatibility — "tenderlovemaking (Aaron Patterson)" <aaron@...>

14 messages 2012/12/14

[#50951] [ruby-trunk - Bug #7584][Open] Ruby hangs when shutting down an ssl connection in gc finalization — "bpot (Bob Potter)" <bobby.potter@...>

12 messages 2012/12/17

[#51076] [ruby-trunk - Feature #7604][Open] Make === comparison operator ability to delegate comparison to an argument — "prijutme4ty (Ilya Vorontsov)" <prijutme4ty@...>

12 messages 2012/12/22

[#51170] [ruby-trunk - Bug #7629][Open] Segmentation fault — "atd (Antonio Tapiador)" <atapiador@...>

13 messages 2012/12/28

[ruby-core:50537] [ruby-trunk - Bug #7501][Rejected] \w in a regular expression doesn't match international characters

From: "shyouhei (Shyouhei Urabe)" <shyouhei@...>
Date: 2012-12-03 18:43:41 UTC
List: ruby-core #50537
Issue #7501 has been updated by shyouhei (Shyouhei Urabe).

Status changed from Open to Rejected

If I remember correctly this is an intentional design.  Because as Unicode version grows, the definition of what is a word character and what is not changes form time to time.  It is hard for us to follow that.
----------------------------------------
Bug #7501: \w in a regular expression doesn't match international characters
https://bugs.ruby-lang.org/issues/7501#change-34380

Author: eltomito (Tomas Partl)
Status: Rejected
Priority: Normal
Assignee: 
Category: core
Target version: 
ruby -v: ruby 1.9.3p0 (2011-10-30 revision 33570) [i686-linux]


When using regexp matching, \w doesn't match characters which are not in the English alphabet.
For example, the characters "žščřďťňaáéíóůúý" should all be matched by \w but aren't.

This program demonstrates the bug:

--------------------------------------------------------
# encoding: utf-8
match = /\w+/.match( "abcdefghijklmnopqrstuvwxyz" )
puts match.to_s
	
match = /\w+/.match( "áéíóůúýžščřďťň" ) #some Czech characters
puts match.to_s

match = /\w+/.match( "üäö" )	#some German characters
puts match.to_s
----------------------------------------------------------

Expected output:
----------------------------------------------------------
abcdefghijklmnopqrstuvwxyz
áéíóůúýžščřďťň
üäö
----------------------------------------------------------
Actual output:
----------------------------------------------------------
abcdefghijklmnopqrstuvwxyz


----------------------------------------------------------



-- 
http://bugs.ruby-lang.org/

In This Thread

Prev Next