From: thomas@... Date: 2014-04-08T12:12:52+00:00 Subject: [ruby-core:61908] [ruby-trunk - Bug #9712] Dir.entries replace Unicode character with questionmarks Issue #9712 has been updated by Thomas Thomassen. Additional info: I've made the RB files have `# encoding: UTF-8` and set `-E UTF-8:UTF-8` which from my understanding of the documentation should affect the encoding returned by `Dir`: ~~~ * call-seq: * Encoding.default_internal -> enc * * Returns default internal encoding. Strings will be transcoded to the * default internal encoding in the following places if the default internal * encoding is not nil: * * * File names from Dir ~~~ But I'm not seeing this behaviour. ---------------------------------------- Bug #9712: Dir.entries replace Unicode character with questionmarks https://bugs.ruby-lang.org/issues/9712#change-46113 * Author: Thomas Thomassen * Status: Assigned * Priority: Normal * Assignee: Zachary Scott * Category: doc * Target version: current: 2.2.0 * ruby -v: ruby 2.2.0dev (2014-04-07 trunk 45528) [i386-mswin32_100] * Backport: 2.0.0: UNKNOWN, 2.1: UNKNOWN ---------------------------------------- My basis when testing this is that I have a computer with English OS - codepage Windows-1252. The tests might yield different result if the Windows codepage is different - so please pay attention to that if you are unable to reproduce. Given a folder named "Foo" which contains a sub-folder "���������" ("\u3066\u3059\u3068") Dir.entries("Foo") will return: [".", "..", "???"] The characters that doesn't fit my filesystem codepage is translated into question marks. I would have expected the strings returned to be in some Unicode format. -- https://bugs.ruby-lang.org/