From: Benoit Daloze <redmine@...>
Date: 2010-06-12T19:54:46+09:00
Subject: [ruby-core:30751] [Bug #3432] bad conversion from UTF8-MAC

Bug #3432: bad conversion from UTF8-MAC
http://redmine.ruby-lang.org/issues/show/3432

Author: Benoit Daloze
Status: Open, Priority: Normal
Category: M17N, Target version: 1.9.x
ruby -v: ruby 1.9.3dev (2010-06-09) [x86_64-darwin10.3.0]

Ruby does not convert well from UTF8-MAC to UTF-8

The attached script shows it is the only encoding to behave that way and which is compatible '��' or '��' in UTF-8.

The results is:
UTF8-MAC Backward conversion: not the same bytes:
	[169, 0, 116, 168, 0] != [195, 169, 116, 195, 168]
UTF8-MAC Backward conversion: not the same chars:
	["\xA9", "\u0000", "t", "\xA8", "\u0000"] != ["��", "t", "��"]

This issue prevent, without Iconv, from converting properly UTF8-MAC String (from HFS paths) to UTF-8


----------------------------------------
http://redmine.ruby-lang.org