[#33000] [Ruby 1.9-Bug#4014][Open] Case-Sensitivity of Property Names Depends on Regexp Encoding — Run Paint Run Run <redmine@...>

Bug #4014: Case-Sensitivity of Property Names Depends on Regexp Encoding

11 messages 2010/11/01

[#33021] Re: [Ruby 1.9-Feature#4015][Open] File::DIRECT Constant for O_DIRECT — Yukihiro Matsumoto <matz@...>

Hi,

15 messages 2010/11/02

[#33139] [Ruby 1.9-Bug#4044][Open] Regex matching errors when using \W character class and /i option — Ben Hoskings <redmine@...>

Bug #4044: Regex matching errors when using \W character class and /i option

8 messages 2010/11/11

[#33162] Windows Unicode (chcp 65001) Generates incorrect output — Luis Lavena <luislavena@...>

Hello,

10 messages 2010/11/14

[#33246] [Ruby 1.9-Feature#4068][Open] Replace current standard Date/DateTime library with home_run — Jeremy Evans <redmine@...>

Feature #4068: Replace current standard Date/DateTime library with home_run

40 messages 2010/11/17

[#33255] [Ruby 1.9-Feature#4071][Open] support basic auth for Net::HTTP.get requests — "coderrr ." <redmine@...>

Feature #4071: support basic auth for Net::HTTP.get requests

23 messages 2010/11/19

[#33322] [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <redmine@...>

Feature #4085: Refinements and nested methods

94 messages 2010/11/24
[#33345] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/11/25

Hi,

[#33356] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/11/25

Hi,

[#33375] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/11/25

Hi,

[#33381] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/11/25

Hi,

[#33387] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Magnus Holm <judofyr@...> 2010/11/25

Woah, this is very nice stuff! Some comments/questions:

[#33487] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Charles Oliver Nutter <headius@...> 2010/11/30

This is a long response, and for that I apologize. I want to make sure

[#33535] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/12/03

Hi,

[#33519] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/12/02

Hi,

[#33523] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/12/02

Hi,

[#33539] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/12/03

Hi,

[#33543] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/12/03

Hi,

[#33546] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/12/03

Hi,

[#33548] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Yusuke ENDOH <mame@...> 2010/12/03

Hi,

[#33567] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Shugo Maeda <shugo@...> 2010/12/04

Hi,

[#33595] Re: [Ruby 1.9-Feature#4085][Open] Refinements and nested methods — Charles Oliver Nutter <headius@...> 2010/12/06

On Sat, Dec 4, 2010 at 6:32 AM, Shugo Maeda <shugo@ruby-lang.org> wrote:

[#33367] Planning to release 1.8.7 fixes on 12/25 (Japanese timezone) — Urabe Shyouhei <shyouhei@...>

Hello,

20 messages 2010/11/25
[#33439] Re: Planning to release 1.8.7 fixes on 12/25 (Japanese timezone) — Luis Lavena <luislavena@...> 2010/11/27

2010/11/25 Urabe Shyouhei <shyouhei@ruby-lang.org>:

[#33456] [Request for Comment] avoid timer thread — SASADA Koichi <ko1@...>

Hi,

25 messages 2010/11/29
[#35152] Re: [Request for Comment] avoid timer thread — Mark Somerville <mark@...> 2011/02/08

On Mon, Nov 29, 2010 at 11:53:03AM +0900, SASADA Koichi wrote:

[#36077] Re: [Request for Comment] avoid timer thread — Mark Somerville <mark@...> 2011/05/09

On Tue, Feb 08, 2011 at 09:24:13PM +0900, Mark Somerville wrote:

[#36952] Re: [Request for Comment] avoid timer thread — Eric Wong <normalperson@...> 2011/06/10

Mark Somerville <mark@scottishclimbs.com> wrote:

[#37080] Re: [Request for Comment] avoid timer thread — Mark Somerville <mark@...> 2011/06/13

On Sat, Jun 11, 2011 at 05:57:11AM +0900, Eric Wong wrote:

[#37103] Re: [Request for Comment] avoid timer thread — Eric Wong <normalperson@...> 2011/06/13

Mark Somerville <mark@scottishclimbs.com> wrote:

[#37187] Re: [Request for Comment] avoid timer thread — SASADA Koichi <ko1@...> 2011/06/16

(2011/06/14 3:37), Eric Wong wrote:

[#37195] Re: [Request for Comment] avoid timer thread — Eric Wong <normalperson@...> 2011/06/17

SASADA Koichi <ko1@atdot.net> wrote:

[#37205] Re: [Request for Comment] avoid timer thread — Eric Wong <normalperson@...> 2011/06/17

Eric Wong <normalperson@yhbt.net> wrote:

[#33469] [Ruby 1.9-Feature#4100][Open] Improve Net::HTTP documentation — Eric Hodel <redmine@...>

Feature #4100: Improve Net::HTTP documentation

12 messages 2010/11/29

[ruby-core:33037] Re: [Ruby 1.9-Bug#4010] YAML fails to roundtrip non ASCII String

From: Aaron Patterson <aaron@...>
Date: 2010-11-03 16:02:33 UTC
List: ruby-core #33037
On Wed, Nov 03, 2010 at 11:36:35AM +0900, Heesob Park wrote:
> 2010/11/3 Aaron Patterson <aaron@tenderlovemaking.com>:
> > On Tue, Nov 02, 2010 at 09:58:27PM +0900, Heesob Park wrote:
> >> Issue #4010 has been updated by Heesob Park.
> >>
> >>
> >> The same result with psych.
> >>
> >> $ ruby -v -ryaml -e 'YAML::ENGINE.yamler = "psych"; s="한글";pYAML.load(YAML.dump(s))==s'
> >> ruby 1.9.3dev (2010-11-02 trunk 29667) [i686-linux]
> >> /usr/local/lib/ruby/1.9.1/psych/deprecated.rb:79: warning: method redefined; discarding old to_yaml_properties
> >> /usr/local/lib/ruby/1.9.1/syck/rubytypes.rb:13: warning: previous definition of to_yaml_properties was here
> >> false
> >>
> >> FYI the current encoding is 'EUC-KR'.
> >> I know it works when the encoding is 'UTF-8'.
> >
> > I think the problem is that your default_internal encoding isn't being
> > set.  YAML is usually stored as UTF-8, but psych will automatically
> > transcode the string to whatever your default_internal encoding is set
> > to.
> >
> > Maybe this script will help illustrate the problem:
> >
> >
> >    # coding: utf-8
> >
> >    require 'psych'
> >
> >    eucjp = "こんにちは!".encode('EUC-JP')
> >    string = Psych.load(Psych.dump(eucjp))
> >
> >    p string.encoding # => #<Encoding:UTF-8>
> >    p eucjp == string # => false
> >
> >    Encoding.default_internal = 'EUC-JP'
> >
> >    string = Psych.load(Psych.dump(eucjp))
> >    p string.encoding # => #<Encoding:EUC-JP>
> >    p eucjp == string # => true
> >
> > Try running your Ruby like this:
> >
> >  $ ruby -EEUC-KR:EUC-KR -v -ryaml -e 'YAML::ENGINE.yamler = "psych"; s="한글";pYAML.load(YAML.dump(s))==s'
> Here is the result
> 
> $ ruby -EEUC-KR:EUC-KR -v -ryaml -e 'YAML::ENGINE.yamler = "psych";
> s="한글";p YAML.load(YAML.dump(s))==s'
> ruby 1.9.3dev (2010-11-02 trunk 29667) [i686-linux]
> true
> 
> Did you mean it is not a bug and I must specify the default external
> and internal character encodings?

Yes.  YAML is stored as UTF-8 or 16 (possibly 32 as well, depending on
the spec you choose).  Strings you pull out of Psych will have an
encoding that matches the source document.

> $ irb
> irb(main):001:0> Encoding.default_external
> => #<Encoding:EUC-KR>
> irb(main):002:0> Encoding.default_internal
> => nil
> Why ruby cannot detect Encoding.default_internal ?

"default_external" indicates the encoding that files on disk probably
have.  A good default is the OS setting.

magic comments indicate the encoding of string literals within that file.

"default_internal" indicates the encoding that you want strings internal
to your programs to have.  Making that decision is not so easy.  Magic
comments cannot be used because there can exist many magic comments in
your programs.

"default_internal" should be used by things like database adapters (or
in this case YAML parsers) where the encoding of the external entity may
differ from the encoding the user wants to use.  The external entity
should transcode to the user's "default_internal" setting.  Because of
this logic, there exists another problem with setting "default_internal"
to something for the user.

Here is an example of what *could* happen if default_internal was
automatically set:

"default_internal" is automatically set to something, say Shift-JIS, and
you load a YAML file.  You know that your YAML file is stored as UTF-8,
and yet when you output the encoding from your program, you're surprised
to see it report Shift-JIS!  You wanted the original encoding of the file,
but now you must call encode() to get it back to UTF-8.  Even worse,
because you went from UTF-8 => Shift-JIS => UTF-8, now there may be data
loss due to encoding round trip problems.

Hope that helps!

-- 
Aaron Patterson
http://tenderlovemaking.com/

In This Thread