[#14696] Inconsistency in rescuability of "return" — Charles Oliver Nutter <charles.nutter@...>

Why can you not rescue return, break, etc when they are within

21 messages 2008/01/02
[#14699] Re: Inconsistency in rescuability of "return" — Gary Wright <gwtmp01@...> 2008/01/02

[#14738] Enumerable#zip Needs Love — James Gray <james@...>

The community has been building a Ruby 1.9 compatibility tip list on

15 messages 2008/01/03
[#14755] Re: Enumerable#zip Needs Love — Martin Duerst <duerst@...> 2008/01/04

Hello James,

[#14772] Manual Memory Management — Pramukta Kumar <prak@...>

I was thinking it would be nice to be able to free large objects at

36 messages 2008/01/04
[#14788] Re: Manual Memory Management — Marcin Raczkowski <mailing.mr@...> 2008/01/05

I would only like to add that RMgick for example provides free method to

[#14824] Re: Manual Memory Management — MenTaLguY <mental@...> 2008/01/07

On Sat, 5 Jan 2008 15:49:30 +0900, Marcin Raczkowski <mailing.mr@gmail.com> wrote:

[#14825] Re: Manual Memory Management — "Evan Weaver" <evan@...> 2008/01/07

Python supports 'del reference', which decrements the reference

[#14838] Re: Manual Memory Management — Marcin Raczkowski <mailing.mr@...> 2008/01/08

Evan Weaver wrote:

[#14911] Draft of some pages about encoding in Ruby 1.9 — Dave Thomas <dave@...>

Folks:

24 messages 2008/01/10

[#14976] nil encoding as synonym for binary encoding — David Flanagan <david@...>

The following just appeared in the ChangeLog

37 messages 2008/01/11
[#14977] Re: nil encoding as synonym for binary encoding — Yukihiro Matsumoto <matz@...> 2008/01/11

Hi,

[#14978] Re: nil encoding as synonym for binary encoding — Dave Thomas <dave@...> 2008/01/11

[#14979] Re: nil encoding as synonym for binary encoding — David Flanagan <david@...> 2008/01/11

Dave Thomas wrote:

[#14993] Re: nil encoding as synonym for binary encoding — Dave Thomas <dave@...> 2008/01/11

[#14980] Re: nil encoding as synonym for binary encoding — Gary Wright <gwtmp01@...> 2008/01/11

[#14981] Re: nil encoding as synonym for binary encoding — Yukihiro Matsumoto <matz@...> 2008/01/11

Hi,

[#14995] Re: nil encoding as synonym for binary encoding — David Flanagan <david@...> 2008/01/11

Yukihiro Matsumoto writes:

[#15050] how to "borrow" the RDoc::RubyParser and HTMLGenerator — Phlip <phlip2005@...>

Core Rubies:

17 messages 2008/01/13
[#15060] Re: how to "borrow" the RDoc::RubyParser and HTMLGenerator — Eric Hodel <drbrain@...7.net> 2008/01/14

On Jan 13, 2008, at 08:54 AM, Phlip wrote:

[#15062] Re: how to "borrow" the RDoc::RubyParser and HTMLGenerator — Phlip <phlip2005@...> 2008/01/14

Eric Hodel wrote:

[#15073] Re: how to "borrow" the RDoc::RubyParser and HTMLGenerator — Eric Hodel <drbrain@...7.net> 2008/01/14

On Jan 13, 2008, at 20:35 PM, Phlip wrote:

[#15185] Friendlier methods to compare two Time objects — "Jim Cropcho" <jim.cropcho@...>

Hello,

10 messages 2008/01/22

[#15194] Can large scale projects be successful implemented around a dynamic programming language? — Jordi <mumismo@...>

A good article I have found (may have been linked by slashdot, don't know)

8 messages 2008/01/24

[#15248] Symbol#empty? ? — "David A. Black" <dblack@...>

Hi --

24 messages 2008/01/28
[#15250] Re: Symbol#empty? ? — Yukihiro Matsumoto <matz@...> 2008/01/28

Hi,

Comment for future documentation of String#[] (etc.)

From: Wolfgang Nádasi-Donner <ed.odanow@...>
Date: 2008-01-05 19:22:34 UTC
List: ruby-core #14796
Hi!

I tested some minutes ago the access to individual characters
in a string object. I did this with an ascii and an utf-8 encoded
string. Here is the variant that adds one character to a pure
ascii string, in one case a "x", in the other a "竄ャ", but did
access the character before this one...

require 'benchmark'
include Benchmark

utf8 =     'abcdefgh' * 1000000 + '竄ャ'
ascii =    'abcdefgh' * 1000000 + 'x'

puts "Zeichen in 'utf8':          #{utf8.length}"
puts "Bytes in 'utf8':            #{utf8.bytesize}"
puts "Encoding von 'utf8':        #{utf8.encoding}"
puts "Zeichen 8000001 in 'utf8':  #{utf8[8000000]}"
puts ""
puts "Zeichen in 'ascii':         #{ascii.length}"
puts "Bytes in 'ascii':           #{ascii.bytesize}"
puts "Encoding von 'ascii':       #{ascii.encoding}"
puts "Zeichen 8000001 in 'ascii': #{ascii[8000000]}"
puts ""

bm(10) do |t|
   t.report('utf8'){100.times{utf8[7999999]}}
   t.report('ascii'){100.times{ascii[7999999]}}
end

...which gave the result (Ruby 1.9.0-0, MinGW/MSYS, Windows2000)...

Zeichen in 'utf8':          8000001
Bytes in 'utf8':            8000003
Encoding von 'utf8':        UTF-8
Zeichen 8000001 in 'utf8':  竄ャ

Zeichen in 'ascii':         8000001
Bytes in 'ascii':           8000001
Encoding von 'ascii':       ASCII-8BIT
Zeichen 8000001 in 'ascii': x

                 user     system      total        real
utf8      117.229000   0.030000 117.259000 (140.752000)
ascii       0.000000   0.000000   0.000000 (  0.000000)

The reason is clear, but there should be a remark in the future
documentation for the methods of class String, that will access
individual characters, that these methods can be very inefficient
when the string is utf-8 encoded and contains somewhere a character
that needs more than one byte.

Wolfgang Nテ。dasi-Donner

In This Thread

Prev Next