From: naruse@... Date: 2016-01-07T05:40:21+00:00 Subject: [ruby-core:72749] [Ruby trunk - Bug #4044] Regex matching errors when using \W character class and /i option Issue #4044 has been updated by Yui NARUSE. Sam Eaton wrote: > /[\W]/ =~ "00FF00" # nil > > /[\W]/i =~ "00FF00" # 2 It's spec. Its mechanism is, \W includes U+FB00 (LATIN SMALL LIGATURE FF). /i option expands it into FF. The the "FF" match given string. ---------------------------------------- Bug #4044: Regex matching errors when using \W character class and /i option https://bugs.ruby-lang.org/issues/4044#change-55994 * Author: Ben Hoskings * Status: Closed * Priority: Normal * Assignee: Yui NARUSE * ruby -v: ruby 1.9.2p0 (2010-08-18 revision 29036) [x86_64-darwin10.4.0] * Backport: ---------------------------------------- =begin Hi all, Josh Bassett and I just discovered an issue with regex matches on ruby-1.9.2p0. (We reduced it while we were hacking on gemcutter.) The case-insensitive (/i) option together with the non-word character class (\W) match inconsistently against the alphabet. Specifically the regex doesn't match properly against the letters 'k' and 's'. The following expression demonstrates the problem in irb: puts ('a'..'z').to_a.map {|c| [c, c.ord, c[/[^\W]/i] ].inspect } As a reference, the following two expressions are working properly: puts ('a'..'z').to_a.map {|c| [c, c.ord, c[/[^\W]/] ].inspect } puts ('a'..'z').to_a.map {|c| [c, c.ord, c[/[\w]/i] ].inspect } Cheers Ben Hoskings & Josh Bassett =end -- https://bugs.ruby-lang.org/ Unsubscribe: