[#6954] Why isn't Perl highly orthogonal? — Terrence Brannon <brannon@...>

27 messages 2000/12/09

[#7022] Re: Ruby in the US — Kevin Smith <kevinbsmith@...>

> Is it possible for the US to develop corporate

36 messages 2000/12/11
[#7633] Re: Ruby in the US — Dave Thomas <Dave@...> 2000/12/19

tonys@myspleenklug.on.ca (tony summerfelt) writes:

[#7636] Re: Ruby in the US — "Joseph McDonald" <joe@...> 2000/12/19

[#7704] Re: Ruby in the US — Jilani Khaldi <jilanik@...> 2000/12/19

> > first candidates would be mysql and postgressql because source is

[#7705] Code sample for improvement — Stephen White <steve@...> 2000/12/19

During an idle chat with someone on IRC, they presented some fairly

[#7750] Re: Code sample for improvement — "Guy N. Hurst" <gnhurst@...> 2000/12/20

Stephen White wrote:

[#7751] Re: Code sample for improvement — David Alan Black <dblack@...> 2000/12/20

Hello --

[#7755] Re: Code sample for improvement — "Guy N. Hurst" <gnhurst@...> 2000/12/20

David Alan Black wrote:

[#7758] Re: Code sample for improvement — Stephen White <steve@...> 2000/12/20

On Wed, 20 Dec 2000, Guy N. Hurst wrote:

[#7759] Next amusing problem: talking integers (was Re: Code sample for improvement) — David Alan Black <dblack@...> 2000/12/20

On Wed, 20 Dec 2000, Stephen White wrote:

[#7212] New User Survey: we need your opinions — Dave Thomas <Dave@...>

16 messages 2000/12/14

[#7330] A Java Developer's Wish List for Ruby — "Richard A.Schulman" <RichardASchulman@...>

I see Ruby as having a very bright future as a language to

22 messages 2000/12/15

[#7354] Ruby performance question — Eric Crampton <EricCrampton@...>

I'm parsing simple text lines which look like this:

21 messages 2000/12/15
[#7361] Re: Ruby performance question — Dave Thomas <Dave@...> 2000/12/15

Eric Crampton <EricCrampton@worldnet.att.net> writes:

[#7367] Re: Ruby performance question — David Alan Black <dblack@...> 2000/12/16

On Sat, 16 Dec 2000, Dave Thomas wrote:

[#7371] Re: Ruby performance question — "Joseph McDonald" <joe@...> 2000/12/16

[#7366] GUIs for Rubies — "Conrad Schneiker" <schneik@...>

Thought I'd switch the subject line to the subject at hand.

22 messages 2000/12/16

[#7416] Re: Ruby IDE (again) — Kevin Smith <kevins14@...>

>> >> I would contribute to this project, if it

17 messages 2000/12/16
[#7422] Re: Ruby IDE (again) — Holden Glova <dsafari@...> 2000/12/16

-----BEGIN PGP SIGNED MESSAGE-----

[#7582] New to Ruby — takaoueda@...

I have just started learning Ruby with the book of Thomas and Hunt. The

24 messages 2000/12/18

[#7604] Any corrections for Programming Ruby — Dave Thomas <Dave@...>

12 messages 2000/12/18

[#7737] strange border-case Numeric errors — "Brian F. Feldman" <green@...>

I haven't had a good enough chance to familiarize myself with the code in

19 messages 2000/12/20

[#7801] Is Ruby part of any standard GNU Linux distributions? — "Pete McBreen, McBreen.Consulting" <mcbreenp@...>

Anybody know what it would take to get Ruby into the standard GNU Linux

15 messages 2000/12/20

[#7938] Re: defined? problem? — Kevin Smith <sent@...>

matz@zetabits.com (Yukihiro Matsumoto) wrote:

26 messages 2000/12/22
[#7943] Re: defined? problem? — Dave Thomas <Dave@...> 2000/12/22

Kevin Smith <sent@qualitycode.com> writes:

[#7950] Re: defined? problem? — Stephen White <steve@...> 2000/12/22

On Fri, 22 Dec 2000, Dave Thomas wrote:

[#7951] Re: defined? problem? — David Alan Black <dblack@...> 2000/12/22

On Fri, 22 Dec 2000, Stephen White wrote:

[#7954] Re: defined? problem? — Dave Thomas <Dave@...> 2000/12/22

David Alan Black <dblack@candle.superlink.net> writes:

[#7975] Re: defined? problem? — David Alan Black <dblack@...> 2000/12/22

Hello --

[#7971] Hash access method — Ted Meng <ted_meng@...>

Hi,

20 messages 2000/12/22

[#8030] Re: Basic hash question — ts <decoux@...>

>>>>> "B" == Ben Tilly <ben_tilly@hotmail.com> writes:

15 messages 2000/12/24
[#8033] Re: Basic hash question — "David A. Black" <dblack@...> 2000/12/24

On Sun, 24 Dec 2000, ts wrote:

[#8178] Inexplicable core dump — "Nathaniel Talbott" <ntalbott@...>

I have some code that looks like this:

12 messages 2000/12/28

[#8196] My first impression of Ruby. Lack of overloading? (long) — jmichel@... (Jean Michel)

Hello,

23 messages 2000/12/28

[#8198] Re: Ruby cron scheduler for NT available — "Conrad Schneiker" <schneik@...>

John Small wrote:

14 messages 2000/12/28

[#8287] Re: speedup of anagram finder — "SHULTZ,BARRY (HP-Israel,ex1)" <barry_shultz@...>

> -----Original Message-----

12 messages 2000/12/29

[ruby-talk:8169] Re: speedup of anagram finder

From: David Alan Black <dblack@...>
Date: 2000-12-28 11:37:12 UTC
List: ruby-talk #8169
On Thu, 28 Dec 2000, Joseph McDonald wrote:

> 
> Yup.  Sorry about posting inaccurate benchmarks.
> Here is the whole test:

[...]

>  head -20000 /usr/share/dict/words | ./anagrams2.rb
>                       user     system      total        real
> keys              5.109375   0.062500   5.171875 (  5.226218)
> each_byte         1.835938   0.023438   1.859375 (  1.888366)
> keys              3.367188   0.000000   3.367188 (  3.421602)
> each_byte         1.835938   0.000000   1.835938 (  1.908974)
> keys              3.343750   0.007812   3.351562 (  3.444110)
> each_byte         1.820312   0.015625   1.835938 (  1.846535)        


OK, to make up for my earlier complete waste of electrons (Joe,
you think *you* posted inaccurate results!), here is something
which I really and truly believe speeds things up.  Most
recent joe() first, followed by devel()....

   require "benchmark"

   def joe(words, out = STDOUT)
     anagrams = {}
     keys = {}
     word, key = nil
     total = 0

     for word in words do
       word.chomp!
       word.downcase!                  # bad -- see *** below (DB)
       key = []
       word.each_byte {|s| key.push(s)}
       key.sort!
       if anagrams[key]
	 anagrams[key] << word
	 keys[key] = 1
       else
	 anagrams[key] = [ word ]
       end
     end
     for key in keys.keys
       out.puts anagrams[key].join(' ')
     end
   end

# *** Note that some words may actually contain uppercase
# letters -- so word.downcase! is a bad idea.  (Hence I'd used key =
# word.dup and then operated on key.)

   def devel(words, out = STDOUT)
     anagrams = {}
     keys = {}
     word, key = nil
     total = 0

     for word in words do
       word.chomp!
       key = word.dup
       key.downcase!
       key = key.unpack('c*')      # <= secret weapon :-)
       key.sort!
       if anagrams[key]
	 anagrams[key] << word
	 keys[key] = 1
       else
	 anagrams[key] = [ word ]
       end
     end
     for key in keys.keys
       out.puts anagrams[key].join(' ')
     end
   end

   WORDS = STDIN.read

   Benchmark::bm(16) do |job|
     GC.start
     job.report("joe")    {joe(WORDS, open('/dev/null', 'w'))}
     GC.start
     job.report("devel")    {devel(WORDS, open('/dev/null', 'w'))}
     GC.start
     job.report("joe")    {joe(WORDS, open('/dev/null', 'w'))}
     GC.start
     job.report("devel")    {devel(WORDS, open('/dev/null', 'w'))}
   end

__END__

   % head -10000 /usr/dict/words  | ruby anagrams.rb
			 user     system      total        real
   joe               3.550000   0.050000   3.600000 (  3.599468)
   devel             2.420000   0.010000   2.430000 (  2.430312)
   joe               3.140000   0.000000   3.140000 (  3.145125)
   devel             2.410000   0.000000   2.410000 (  2.406263)


This time around, using 'join' on the array slowed things down,
so I took it out.


David

-- 
David Alan Black
home: dblack@candle.superlink.net
work: blackdav@shu.edu
Web:  http://pirate.shu.edu/~blackdav


In This Thread