From: Yui NARUSE Date: 2012-01-07T19:49:08+09:00 Subject: [ruby-core:41956] [ruby-trunk - Bug #5855][Rejected] inconsistent treatment of 8 bit characters in US-ASCII Issue #5855 has been updated by Yui NARUSE. Status changed from Open to Rejected U+0080 of Unicode can't be mapped to 0x80 of US-ASCII. In US-ASCII, the codepoint 0x80 exists, but doesn't define any character. ---------------------------------------- Bug #5855: inconsistent treatment of 8 bit characters in US-ASCII https://bugs.ruby-lang.org/issues/5855 Author: John Firebaugh Status: Rejected Priority: Normal Assignee: Category: Target version: ruby -v: ruby 1.9.3p0 (2011-10-30 revision 33570) [x86_64-darwin10.8.0] =begin Does Ruby allow 8 bit characters (127-255) in a US-ASCII encoded string, or not? "\u{80}".encode("US-ASCII") #=> Encoding::UndefinedConversionError 0x80.chr("US-ASCII") #=> "\x80" (US-ASCII encoding) "".encode("US-ASCII") << 128 #=> "\x80" (US-ASCII encoding) "".encode("US-ASCII") << 128.chr #=> "\x80" (ASCII-8BIT encoding) =end -- http://redmine.ruby-lang.org