[#19064] Fwd: [ruby-dev:36523] Re: Encoding.default_internal — Martin Duerst <duerst@...>
There has been some disconnect lately between ruby-dev and ruby-core
On Oct 1, 2008, at 5:09 AM, Martin Duerst wrote:
On Wed, Oct 1, 2008 at 9:46 AM, James Gray <james@grayproductions.net> wrote:
[#19075] Request For Removal: No Operator Concatenation — James Gray <james@...>
I'm disappointed that Ruby still supports this goofy syntax:
On Wed, Oct 1, 2008 at 1:58 PM, James Gray <james@grayproductions.net> wrote:
On Wed, Oct 1, 2008 at 1:08 PM, Gregory Brown <gregory.t.brown@gmail.com> wrote:
On Oct 1, 2008, at 1:15 PM, Jim Freeze wrote:
On Wed, Oct 1, 2008 at 1:29 PM, James Gray <james@grayproductions.net> wrote:
On Oct 1, 2008, at 1:37 PM, Jim Freeze wrote:
On Oct 1, 2008, at 11:42 AM, James Gray wrote:
On Wed, Oct 1, 2008 at 2:45 PM, Eric Hodel <drbrain@segment7.net> wrote:
On Wed, Oct 1, 2008 at 2:10 PM, Gregory Brown <gregory.t.brown@gmail.com> wrote:
On Oct 1, 2008, at 2:17 PM, Jim Freeze wrote:
On Wed, Oct 1, 2008 at 2:25 PM, James Gray <james@grayproductions.net> wrote:
On Oct 1, 2008, at 12:30 PM, Jim Freeze wrote:
Hi,
On Oct 1, 2008, at 10:33 PM, Yusuke ENDOH wrote:
[#19127] Autoload and class definition — Tomas Matousek <Tomas.Matousek@...>
I've found an interesting corner case of autoload behavior, which I think is wrong.
[#19132] [Feature #615] "with" operator — Lavir the Whiolet <redmine@...>
Feature #615: "with" operator
Hi,
On Mon, Oct 06, 2008 at 10:46:49AM +0900, Nobuyoshi Nakada wrote:
On Mon, Oct 06, 2008 at 10:56:23PM +0900, Paul Brannan wrote:
On Mon, Oct 6, 2008 at 3:34 PM, Trans <transfire@gmail.com> wrote:
Hi --
On Tue, Oct 07, 2008 at 05:47:23AM +0900, David A. Black wrote:
Hi --
[#19168] [Bug:1.9] rubygems depend on test/unit/ui/console/testrunner — "Yusuke ENDOH" <mame@...>
Hi,
On Oct 7, 2008, at 07:43 AM, Yusuke ENDOH wrote:
Eric Hodel wrote:
[#19225] Module.freeze vs Object.freeze — Curt Hagenlocher <curth@...>
What's the difference between Module.freeze and Object.freeze? They seem to have the same effect.
[#19242] Regexp Order Matters in 1.9 — James Gray <james@...>
I'm just curious, why does this work:
[#19250] default_internal encoding — Dave Thomas <dave@...>
I'm documenting default_internal for the PickAxe, and have a couple of
Hi,
On Oct 9, 2008, at 6:06 PM, Michael Selig wrote:
On Fri, 10 Oct 2008 13:09:31 +1100, James Gray <james@grayproductions.net>
On Wed, Oct 8, 2008 at 3:52 PM, Paul Brannan <pbrannan / atdesk.com> wrote:
On Fri, Oct 10, 2008 at 10:30:31AM +0900, Michael Selig wrote:
Paul Brannan wrote:
Charles Oliver Nutter wrote:
[#19294] [Bug #634] Time parsing works in 1.8 but not 1.9 — Aaron Patterson <redmine@...>
Bug #634: Time parsing works in 1.8 but not 1.9
Issue #634 has been updated by tadayoshi funaba.
[#19298] [Feature #639] New String#encode_internal method — Michael Selig <redmine@...>
Feature #639: New String#encode_internal method
Hi,
[#19304] 1.9, encoding & win32 wide char support — Lloyd Hilaiel <lloyd@...>
hello,
[#19315] [Feature #643] __DIR__ — Thomas Sawyer <redmine@...>
Feature #643: __DIR__
[#19332] Can I confirm a change to source file encoding — Dave Thomas <dave@...>
A month ago, if I had
[#19342] [Bug #649] Memory leak in a array assignment? — Henri Suur-Inkeroinen <redmine@...>
Bug #649: Memory leak in a array assignment?
On Tue, Feb 3, 2009 at 8:44 PM, Brent Roman <brent@mbari.org> wrote:
[#19343] Yet another block semantic/syntax question — "David A. Black" <dblack@...>
Hi --
[#19350] Net::HTTP.post_form bug : can't post form to correct uri which contains QueryString(QueryString part are lost) and revise — Klesh <kleshwong@...>
Hi,
You are trying to use GET-style query params instead of POSTing the
Dear Matt
From my experience, it's simply easier to process requests that way,
Thanks,
2008/10/17 Matt Todd <chiology@gmail.com>:
On Oct 19, 2008, at 8:55 AM, mathew wrote:
[#19373] capture_io in minitest — Tanaka Akira <akr@...>
capture_io changes $stdout.fileno.
[#19378] Constant names in 1.9 — Dave Thomas <dave@...>
When Ruby makes the tIDENTIFIER/tCONSTANT test, it looks to see if the
Hi,
On Oct 18, 2008, at 8:32 AM, Yukihiro Matsumoto wrote:
Hi,
[#19385] [Bug #657] Thread.new { fork } — "James M. Lawrence" <redmine@...>
Bug #657: Thread.new { fork }
[#19388] [Bug #663] Benchmark.measure outputs different result when executed using command line "ruby -e ..." — Artem Vorozhtsov <redmine@...>
Bug #663: Benchmark.measure outputs different result when executed using command line "ruby -e ..."
[#19397] [Feature #666] Enumerable::to_hash — Marc-Andre Lafortune <redmine@...>
Feature #666: Enumerable::to_hash
Issue #666 has been updated by Yukihiro Matsumoto.
Hi,
Thank you for this explanation. If I understand correctly, you want methods
Hi,
Thank you for your response
On Wed, 22 Apr 2009 05:45:06 +0900
[#19410] rb_errinfo() vs rb_rubylevel_errinfo() — Paul Brannan <pbrannan@...>
What is the difference between these two functions?
Hi,
[#19413] Is this expected, or should I report it? — Dave Thomas <dave@...>
Given
[#19422] Now that lambda has more powerful arguments... — Dave Thomas <dave@...>
is there anything that
Dave Thomas schrieb:
On Wed, Oct 22, 2008 at 04:01:45AM +0900, Dave Thomas wrote:
Hi --
On Wed, Oct 22, 2008 at 04:38:19AM +0900, David A. Black wrote:
Hi --
On Oct 21, 2008, at 4:24 PM, David A. Black wrote:
Hi --
[#19446] confused by this catch table — Paul Brannan <pbrannan@...>
irb(main):001:0> require 'internal/proc'
[#19458] Should Method@instance_methods reveal protected methods? — Dave Thomas <dave@...>
The RDoc says it just returns public methods, but
[#19465] [Bug #680] csv.rb: CSV.parse is too late when encoding is mismatch — Takeyuki Fujioka <redmine@...>
Bug #680: csv.rb: CSV.parse is too late when encoding is mismatch
Hi,
A default for the source encoding has been discussed quite a long
On Oct 24, 2008, at 1:52 AM, Martin Duerst wrote:
On Oct 24, 2008, at 8:06 AM, James Gray wrote:
On Sat, 25 Oct 2008 00:07:13 +1100, James Gray <james@grayproductions.net>
On Oct 26, 2008, at 6:48 PM, Michael Selig wrote:
Hi,
Hi,
Hi,
Hi,
On Sun, 26 Oct 2008 17:26:32 +1100, Nobuyoshi Nakada <nobu@ruby-lang.org>
Hi,
On Sun, 26 Oct 2008 23:34:26 +1100, Nobuyoshi Nakada <nobu@ruby-lang.org>
Hi,
On Mon, 27 Oct 2008 16:07:54 +1100, Nobuyoshi Nakada <nobu@ruby-lang.org>
Hi,
On Mon, 27 Oct 2008 17:27:57 +1100, Nobuyoshi Nakada <nobu@ruby-lang.org>
Hi,
On Mon, 27 Oct 2008 20:55:32 +1100, Nobuyoshi Nakada <nobu@ruby-lang.org>
Hi,
On Oct 27, 2008, at 7:07 AM, Nobuyoshi Nakada wrote:
Hi,
[#19468] [Bug:1.9] failures of test/minitest — Nobuyoshi Nakada <nobu@...>
Hi,
[#19478] Ruby 1.8.7 Throwing "Too many open files" Exception lately??? — "C.E. Thornton" <admin@...>
Group,
[#19487] [ANN] Sipper 1.1.3 Released — "Nasir Khan" <rubylearner@...>
1.1.3 of SIPr pronounced as Sipper has been released earlier this month.
[#19504] Is the stabby proc gone? broken? — "David A. Black" <dblack@...>
Hi --
[#19523] Too Many Files Error -- Test Case Produced. — "C.E. Thornton" <admin@...>
Core,
[#19555] Managing 1.9 threads in extensions — Dave Thomas <dave@...>
I'm trying to pin down the rules for folks who write extensions for
[#19561] Was there a feature freeze on October 25th? — Dave Thomas <dave@...>
Curious authors want to know... :)
[#19564] Ruby 1.9.1 preview1 is out — "Yugui (Yuki Sonoda)" <yugui@...>
Hi all,
[#19566] GC thought — "Roger Pack" <roger.pack@...>
Here is a recent patch I've been experimenting with--for any advice. [1]
On Tue, 28 Oct 2008 17:02:17 +0900, Roger Pack wrote:
> Letting the program continue execution during the mark phase could cause
On Wed, Oct 29, 2008 at 01:04:52AM +0900, Roger Pack wrote:
2008/10/28 Paul Brannan <pbrannan@atdesk.com>:
Robert Klemme wrote:
On Wed, Oct 29, 2008 at 4:02 AM, Hongli Lai <hongli@plan99.net> wrote:
[#19578] [Bug #691] Time::zone_utc? does not follow rfc2822 — Chun Wang <redmine@...>
Bug #691: Time::zone_utc? does not follow rfc2822
[#19583] [Bug #694] eof? call on a pty IO object causes application to exit — Dave Thomas <redmine@...>
Bug #694: eof? call on a pty IO object causes application to exit
[#19590] [Feature #695] More flexibility when combining ASCII-8BIT strings with other encodings — Michael Selig <redmine@...>
Feature #695: More flexibility when combining ASCII-8BIT strings with other encodings
Hi,
At 07:14 08/10/31, Michael Selig wrote:
Hi
[#19599] Future of Continuations — "r. schempp" <ruben.schempp@...>
Hi,
r. schempp schrieb:
r. schempp schrieb:
[#19604] test failure in r20022 — Mike Stok <mike@...>
I noticed this failure in my morning build of ruby trunk on my laptop:
[#19610] [Bug 1.9] gem_prelude.rb always require rubygems — Yukihiro Matsumoto <matz@...>
Hi,
[#19618] Result of backticks — Jim Deville <jdeville@...>
`echo disc world` returns "disc world\n"
[#19634] performance issues with --enable-pthread on Solaris. — Paul van den Bogaard <Paul.Vandenbogaard@...>
Introduction
[#19660] Odd TypeError in inject (1.9.1 preview 1) — "David A. Black" <dblack@...>
Hi --
On Fri, Oct 31, 2008 at 5:20 AM, David A. Black <dblack@rubypal.com> wrote:
Hi,
On Fri, Oct 31, 2008 at 8:40 AM, Nobuyoshi Nakada <nobu@ruby-lang.org>wrote:
[#19668] [Bug #703] string output duplication occurs if the same file descriptor written to in different threads — Roger Pack <redmine@...>
Bug #703: string output duplication occurs if the same file descriptor written to in different threads
Hi,
[ruby-core:19515] String literal encoding (Was: Default source encoding (Was: [Bug #680] csv.rb: CSV.parse is toolate when encoding is mismatch))
Hi, Sorry, perhaps I have been giving a (bad) solution, rather than stating the problem clearly, so let me try again! I certainly didn't mean to suggest there should be any transcoding of string literals by Ruby's parser. So here are the problems as I see them. They are all to do with the default encoding of string literals, and they are all fairly minor, but I think addressing them has merit: 1) The encoding of string literals constructed with "\x..." is ambiguous. Well not strictly ambiguous, but certainly it can be confusing. The trouble is that a string literal like the example in bug #680 "\x82\xA0,\x82\xA2" can either be used as a "binary" string (ASCII-8BIT) or an encoded character string (intended to be Shift_JIS in this case), but this depends on the source encoding. While technically these are the same data, they are used in quite different ways in practice. Also, as we see in the bug report, it can cause mysterious errors such as "Bad UTF-8 string" because the source encoding was apparently UTF-8 not Shift_JIS (thank you to Martin for pointing this out). Ruby treats strings constucted with "\u..." differently: they are set to UTF-8 no matter what the source encoding. I think this is the correct behaviour - there is no ambiguity. But "\x..." is not treated like this. When the source encoding is not specified (or is US-ASCII), a "\x.." string is set to ASCII-8BIT. Again I think this is the correct behaviour. However if the source encoding is set to anything else, the encoding of the string is set to the source encoding. I think this is the part that is wrong, especially as the resultant string can be "broken", and no warning is given about this by the parser. My preference would be to *always* encode string literals constructed with "\x.." as ASCII-8BIT, ignoring the source encoding. This means that if you really want to use such a literal as an encoded string, you must use "force_encoding". I think this would be much clearer and get rid of the "ambiguity". 2) I find it slightly redundant to have to specify BOTH the default_internal, and the source encoding at the top of an m17n script which contains multibyte string literals, when in all practical cases they should be the same. eg: #! /usr/bin/ruby -E:UTF-8 # encoding: UTF-8 My suggestion for "defaulting" the source encoding was an attempt to avoid having to do this (but probably not a good way!). It isn't a big deal, and I understand the argument that the source encoding is a property of the script. My original suggestion (last month) of a special magic comment was to have a way of specifying BOTH the default_internal and source encoding once, but this idea was rejected. 3) I think there should be some check (warning message?) that the (non ASCII-8BIT) string literals in a library file are compatible with the "default_internal" of the calling program (if it is set). Ideally this check would be done when the "require" is called to flag possible incompatibilities early. Perhaps this check could be based on the library's source encoding? If this were done, most libraries would have to use a source encoding of US-ASCII (or just have no encoding magic comment) *not* UTF-8, so that non-Unicode default_internal's will work. Perhaps Ruby could be smarter, and only flag an error if there actually is an incomaptible string literal in the library? 4) I was surprised at the different source encoding behaviour when using "-e" compared to a script in a file. (Again thank you to Martin for telling me about it) Matz wrote: > -e takes programs from command line shell, which probably yields > strings in locale encoding anyway. But we cannot assume that for > scripts contained in files. Again I understand the sentiment, but for a simple non-m17n, non-ascii ruby script that was likely written with an editor on the same machine or in the same locale, why force it to have an "encoding" magic comment? Also it means that: ruby test.rb may perform differently than: ruby -e "`cat test.rb`" Again potentially confusing, but not a big deal. I hope I have made myself clearer this time! Thanks, Mike.