From: nobu@... Date: 2015-12-17T01:36:24+00:00 Subject: [ruby-core:72199] [Ruby trunk - Bug #11825] MatchData#names returns ASCII-8BIT Issue #11825 has been updated by Nobuyoshi Nakada. Description updated Backport changed from 2.0.0: UNKNOWN, 2.1: UNKNOWN, 2.2: UNKNOWN to 2.0.0: REQUIRED, 2.1: REQUIRED, 2.2: REQUIRED ---------------------------------------- Bug #11825: MatchData#names returns ASCII-8BIT https://bugs.ruby-lang.org/issues/11825#change-55611 * Author: Toru Iwase * Status: Closed * Priority: Normal * Assignee: * ruby -v: ruby 2.3.0preview2 (2015-12-11 trunk 53028) [x86_64-linux] * Backport: 2.0.0: REQUIRED, 2.1: REQUIRED, 2.2: REQUIRED ---------------------------------------- `MatchData#names` returns ASCII-8BIT. But `MatchData#[]` doesn't accept ASCII-8BIT name if name has non-ASCII chars. `MatchData#names` should return same encoding of regexp. (UTF-8 by default) ~~~ $ irb ruby 2.3.0preview2 (2015-12-11 trunk 53028) [x86_64-linux] irb(main):001:0> m = /(?<������>���������)/.match('���������') => # irb(main):002:0> m.names => ["\xE5\xAE\x9D\xE7\x9F\xB3"] irb(main):003:0> m[m.names[0]] IndexError: undefined group name reference: "\xE5\xAE\x9D\xE7\x9F\xB3" from (irb):3:in `[]' from (irb):3 from /usr/local/anyenv/envs/rbenv/versions/2.3.0-preview2/bin/irb:11:in `
' irb(main):004:0> m['������'] => "���������" irb(main):005:0> m.names[0].encoding => # irb(main):006:0> m = /(?���������)/.match('���������') => # irb(main):007:0> m.names => ["gem"] irb(main):008:0> m.names[0].encoding => # irb(main):009:0> m[m.names[0]] => "���������" ~~~ -- https://bugs.ruby-lang.org/