[#15359] Timeout::Error — Jeremy Thurgood <jerith@...>

Good day,

41 messages 2008/02/05
[#15366] Re: Timeout::Error — Eric Hodel <drbrain@...7.net> 2008/02/06

On Feb 5, 2008, at 06:20 AM, Jeremy Thurgood wrote:

[#15370] Re: Timeout::Error — Jeremy Thurgood <jerith@...> 2008/02/06

Eric Hodel wrote:

[#15373] Re: Timeout::Error — Nobuyoshi Nakada <nobu@...> 2008/02/06

Hi,

[#15374] Re: Timeout::Error — Jeremy Thurgood <jerith@...> 2008/02/06

Nobuyoshi Nakada wrote:

[#15412] Re: Timeout::Error — Nobuyoshi Nakada <nobu@...> 2008/02/07

Hi,

[#15413] Re: Timeout::Error — Jeremy Thurgood <jerith@...> 2008/02/07

Nobuyoshi Nakada wrote:

[#15414] Re: Timeout::Error — Nobuyoshi Nakada <nobu@...> 2008/02/07

Hi,

[#15360] reopen: can't change access mode from "w+" to "w"? — Sam Ruby <rubys@...>

I ran 'rake test' on test/spec [1], using

16 messages 2008/02/05
[#15369] Re: reopen: can't change access mode from "w+" to "w"? — Nobuyoshi Nakada <nobu@...> 2008/02/06

Hi,

[#15389] STDIN encoding differs from default source file encoding — Dave Thomas <dave@...>

This seems strange:

21 messages 2008/02/06
[#15392] Re: STDIN encoding differs from default source file encoding — Yukihiro Matsumoto <matz@...> 2008/02/06

Hi,

[#15481] very bad character performance on ruby1.9 — "Eric Mahurin" <eric.mahurin@...>

I'd like to bring up the issue of how characters are represented in

16 messages 2008/02/10

[#15528] Test::Unit maintainer — Kouhei Sutou <kou@...>

Hi Nathaniel, Ryan,

22 messages 2008/02/13

[#15551] Proc#curry — ts <decoux@...>

21 messages 2008/02/14
[#15557] Re: [1.9] Proc#curry — David Flanagan <david@...> 2008/02/15

ts wrote:

[#15558] Re: [1.9] Proc#curry — Yukihiro Matsumoto <matz@...> 2008/02/15

Hi,

[#15560] Re: Proc#curry — Trans <transfire@...> 2008/02/15

[#15585] Ruby M17N meeting summary — Martin Duerst <duerst@...>

This is a rough translation of the Japanese meeting summary

19 messages 2008/02/18

[#15596] possible bug in regexp lexing — Ryan Davis <ryand-ruby@...>

current:

17 messages 2008/02/19

[#15678] Re: [ANN] MacRuby — "Rick DeNatale" <rick.denatale@...>

On 2/27/08, Laurent Sansonetti <laurent.sansonetti@gmail.com> wrote:

18 messages 2008/02/28
[#15679] Re: [ANN] MacRuby — "Laurent Sansonetti" <laurent.sansonetti@...> 2008/02/28

On Thu, Feb 28, 2008 at 6:33 AM, Rick DeNatale <rick.denatale@gmail.com> wrote:

[#15680] Re: [ANN] MacRuby — Yukihiro Matsumoto <matz@...> 2008/02/28

Hi,

[#15683] Re: [ANN] MacRuby — "Laurent Sansonetti" <laurent.sansonetti@...> 2008/02/28

On Thu, Feb 28, 2008 at 1:51 PM, Yukihiro Matsumoto <matz@ruby-lang.org> wrote:

Re: very bad character performance on ruby1.9

From: "Eric Mahurin" <eric.mahurin@...>
Date: 2008-02-17 15:11:24 UTC
List: ruby-core #15580
On Feb 16, 2008 8:51 PM, Charles Oliver Nutter <charles.nutter@sun.com>
wrote:

> Eric Mahurin wrote:
> > I'd like to bring up the issue of how characters are represented in ruby
> > 1.9 from a performance standpoint.  In a recent ruby-quiz (parsing
> > JSON), the fastest pure-ruby solution was simply an LL(1) parser that
> > looked at one character at a time (it beat various Regexp solutions).
> > With ruby 1.9, the runtime increased by 4X making it a slow solution.  A
> > simple benchmark is at the end of this message that counts words in an
> > LL(1) fashion.  With ruby 1.8.6, it can could the words in Homer's Iliad
> > in 1.46s on my machine and in ruby 1.9 (from ubuntu gutsy) it takes
> > 52.87s (36X increase in runtime).
>
> Interesting benchmark; may I include it in JRuby's collection of
> benchmarks?
>
> - Charlie
>

Of course!  It is basically just a test of IO#getc or StringIO#getc along
with comparison of those characters with some fixed characters.  I think it
is a good representation of LL(1) character parsing/lexing.  LR/LALR/DFA/NFA
character parsing/lexing will have similar operations.  Instead of recursive
code, you'll have a state machine/stack for those.  Also, for any of these,
table lookup could be done instead of comparison, but it would cost a bit of
memory for character sets (probably slower too).

Most people coding in ruby use Regexp (DFA or NFA parsing itself?) at the
character-level, so the above isn't necessarily representative.  I just find
using Regexp with an IO more trouble than it is worth when writing a clean
parser/lexer (generator).

Eric

In This Thread

Prev Next