[#390749] Why are there so many similar/identical methods in core classes — Kassym Dorsel <k.dorsel@...>

Let's look at the Array class and start with method aliases.

14 messages 2011/12/02

[#390755] Inverse Operation of Module#include — Su Zhang <su.comp.lang.ruby@...>

Hi list,

21 messages 2011/12/02
[#390759] Re: Inverse Operation of Module#include — Ryan Davis <ryand-ruby@...> 2011/12/02

[#390764] Re: Inverse Operation of Module#include — Isaac Sanders <isaacbfsanders@...> 2011/12/02

I would suggest an Adapter pattern use here. IF there is something that has

[#390795] Is there a better way to check this array? — Wayne Brissette <wbrisett@...>

I have an array which contains items that need to be renamed. Unfortunately the way things are changed are based on a comparison of if something else exists. For example in this simple array:

9 messages 2011/12/03

[#390876] black magical hash element vivification — Chad Perrin <code@...>

Ruby (1.9.3p0 to be precise, installed with RVM) is not behaving as I

12 messages 2011/12/05

[#390918] WEB SURVEY about Ruby Community — Intransition <transfire@...>

Did any one else get this survey request?

14 messages 2011/12/07

[#390972] Which is the best online document conversion site? — "Maria M." <mariamoore952@...>

Can anyone tell me that which is the best online document conversion

7 messages 2011/12/08

[#390976] Confusing results from string multiplication — Rob Marshall <robmarshall@...>

Hi,

19 messages 2011/12/08

[#391019] How can I do h["foo"] += "bar" if h["foo"] does not exist? — "Andrew S." <andrewinfosec@...>

Hi there,

13 messages 2011/12/09

[#391027] reading from file without end-of-lines — Janko Muzykant <umrzykus@...>

hi,

20 messages 2011/12/09
[#391028] Re: reading from file without end-of-lines — Gavin Sinclair <gsinclair@...> 2011/12/09

> i'm trying to read a few text values from single file:

[#391031] Re: reading from file without end-of-lines — Robert Klemme <shortcutter@...> 2011/12/09

On Fri, Dec 9, 2011 at 9:58 AM, Gavin Sinclair <gsinclair@gmail.com> wrote:

[#391042] Re: reading from file without end-of-lines — Gavin Sinclair <gsinclair@...> 2011/12/09

On Fri, Dec 9, 2011 at 8:18 PM, Robert Klemme

[#391135] I need advice on what to do next. — Nathan Kossaeth <system_freak_2004@...>

I am new to programming. I read the ebook "Learn to Program" by Chris

23 messages 2011/12/12

[#391216] perf optimization using profile results — Chuck Remes <cremes.devlist@...>

I need some help with optimizing a set of libraries that I use. They are ffi-rzmq, zmqmachine and rzmq_brokers (all up on github).

13 messages 2011/12/13
[#391218] Re: perf optimization using profile results — Chuck Remes <cremes.devlist@...> 2011/12/13

On Dec 13, 2011, at 9:57 AM, Chuck Remes wrote:

[#391234] Re: perf optimization using profile results — Charles Oliver Nutter <headius@...> 2011/12/14

A couple quick observations.

[#391238] Re: perf optimization using profile results — Chuck Remes <cremes.devlist@...> 2011/12/14

On Dec 13, 2011, at 7:03 PM, Charles Oliver Nutter wrote:

[#391324] ruby 1.9 threading performance goes non-linear — Joel VanderWerf <joelvanderwerf@...>

12 messages 2011/12/16
[#391325] Re: ruby 1.9 threading performance goes non-linear — Eric Wong <normalperson@...> 2011/12/16

Joel VanderWerf <joelvanderwerf@gmail.com> wrote:

[#391420] Accessing class instance variables from an instance? — "Shareef J." <shareef@...>

Hi there,

26 messages 2011/12/20
[#391454] Re: Accessing class instance variables from an instance? — Khat Harr <myphatproxy@...> 2011/12/21

Actually, now that I'm thinking about it the existing behavior sort of

[#391456] Re: Accessing class instance variables from an instance? — Josh Cheek <josh.cheek@...> 2011/12/21

On Tue, Dec 20, 2011 at 9:42 PM, Khat Harr <myphatproxy@hotmail.com> wrote:

[#391545] Kernel#exit raises an exception? — Khat Harr <myphatproxy@...>

While I was working on embedding an interpreter I wrote a function to

13 messages 2011/12/24

[#391618] rvmsh: An easy installer for RVM — Bryan Dunsmore <dunsmoreb@...>

I have recently begun work on a project called [rvmsh]

12 messages 2011/12/29

[#391783] Mailspam — Gunther Diemant <g.diemant@...>

Is there a way to stop this mailspam of Luca (Mail)?

12 messages 2011/12/29

[#391790] What’s the standard way of implementing #hash for value objects in Ruby? — Nikolai Weibull <now@...>

Hi!

23 messages 2011/12/29
[#391792] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Gunther Diemant <g.diemant@...> 2011/12/29

I think you can't access instance variables from a class method, so

[#391793] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Nikolai Weibull <now@...> 2011/12/29

On Thu, Dec 29, 2011 at 15:52, Gunther Diemant <g.diemant@gmx.net> wrote:

[#391811] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Robert Klemme <shortcutter@...> 2011/12/29

On Thu, Dec 29, 2011 at 4:06 PM, Nikolai Weibull <now@bitwi.se> wrote:

[#391812] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Nikolai Weibull <now@...> 2011/12/29

On Fri, Dec 30, 2011 at 00:26, Robert Klemme <shortcutter@googlemail.com> wrote:

[#391816] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Josh Cheek <josh.cheek@...> 2011/12/30

On Thu, Dec 29, 2011 at 5:47 PM, Nikolai Weibull <now@bitwi.se> wrote:

[#391833] Re: What’s the standard way of implementing #hash for value objects in Ruby? — Robert Klemme <shortcutter@...> 2011/12/30

On Fri, Dec 30, 2011 at 12:47 AM, Nikolai Weibull <now@bitwi.se> wrote:

Re: What’s the standard way of implementing #hash for value objects in Ruby?

From: Robert Klemme <shortcutter@...>
Date: 2011-12-31 12:58:24 UTC
List: ruby-talk #391865
On Fri, Dec 30, 2011 at 3:43 PM, Nikolai Weibull <now@bitwi.se> wrote:
> On Fri, Dec 30, 2011 at 15:02, Robert Klemme <shortcutter@googlemail.com> wrote:
>> On Fri, Dec 30, 2011 at 12:47 AM, Nikolai Weibull <now@bitwi.se> wrote:
>
>>> I seem to have failed in communicating what I知 after. 潛 wasn稚 after
>>> different ways of implementing #hash, I was after the golden standard
>>> of #hash implementations for value objects. 澳o, again, what痴 the
>>> standard way of implementing #hash for value objects in Ruby?
>
>> I have covered this a bit earlier:
>> http://blog.rubybestpractices.com/posts/rklemme/018-Complete_Class.html
>
>> Boils down to that the hash code should be derived from hash codes of
>> all fields used as key fields in determining equivalence.
>
> Precisely. 漓nd how do you get a single value from the hash values of
> those fields?
>
>>> One suggestion would be the XOR of the hash values of the object痴
>>> class and its instance variables.
>
>> Not good, because then the same value in different fields has
>> identical influence on the hash code. 澆onsequently likelihood of
>> collisions increases and thus performance of Hash storage and
>> retrieval may decrease.
>
> This is exactly what you suggest in your article, with the addition
> (or rather XOR) of the hash value of the object痴 class.

Right.  I should probably have given more detail in this thread with
regard to what goals this is "worse" than the simple XOR of member
hash values.

>> But still, I don't see the need. 潘ote also that a proper Hash key
>> usually should be immutable because changing them causes all sorts of
>> trouble if not done carefully.
>
> Hence the use of 砺alue objectin my question.

I can't see that "value object" implies "immutable".

http://c2.com/cgi/wiki?ValueObject
http://en.wikipedia.org/wiki/Value_object

>> And most of the time Hash keys are
>> String, Symbol, Fixnum and the like - which all do have their #hash
>> implementation already.
>
> But how do you combine them? 澣hat痴 what this whole thread has been
> about.  realize now that I made quite a few assumptions about what I
> thought readers would understand from my original question that may
> not have been as obvious to them as they were to me.

There you have learned something about communication. :-)

> First, let me apologize for this lack of clarity.

No sweat.

> Second, let me rephrase my
> question and add some additional context and examples:
>
> What algorithm should one employ in the calculation of the hash value
> of an arbitrary value object?

There is no single standard (or best) way.  The fact that different
languages (Java, Ruby...) have different means to calculate combined
hash values which all seem to work pretty well indicates this IMHO.

> As an example, what algorithm should one employ in the calculation of
> the hash value of an immutable class A containing three immutable
> instance variables @a, @b, @c that contain, respectively, a String, a
> Fixnum, and a Symbol and that are all used in the calculation of #==?

In this case I would choose the most straightforward solution: XOR of
all member hash codes.  This is sufficient since you indicated that no
value can show up in two different fields.

> The semi-standard solutions seem to be
>
> class A
> ef hash
> a ^ @b ^ @c
> nd
> end
>
> and
>
> class A
> ef hash
> @a, @b, @c].hash
> nd
> end
>
> These are the two main implementations in the Standard Library,
> anyway. he first is also the solution proposed by Robert Klemme in
>
> http://blog.rubybestpractices.com/posts/rklemme/018-Complete_Class.html

And it does work indeed.  But keep in mind that the article is not a
specific article about calculating hash codes but rather about what
has to be done to make a class "complete".  Hash codes are just one
aspect.

> I would claim that the algorithm should take the class of the object
> into account as well, both for consistency with #== (which should
> check equality of the classes of the objects being compared) and for
> added entropy.

Considering the most common case that all keys in a Hash are of the
same class it is questionable whether this will really add entropy
since then for all instances the class's hash will be the same.  You
pay a price for additional calculation though.

> Internally, Ruby (primarily) uses three C functions for the
> calculation of combined hash values, namely rb_hash_start,
> rb_hash_uint, and rb_hash_end. s an example, the hash value of a
> Struct is calculated (in Ruby with these three functions wrapped in an
> imaginary module C) as
>
> class Struct
> ef hash
> .rb_hash_end(reduce(C.rb_hash_start(self.class.hash)){ |h, v|
> C.rb_hash_uint(h, v.hash) })
> nd
> end
>
> Might it be useful to have Ruby expose a way to perform this
> calculation from the Ruby realm so that other classes may employ this
> algorithm?

Not sure whether we would really gain that much.  Those calls are
efficient in C but if you provide that mechanism in Ruby land you will
have multiple calls, e.g.

def hash
  h = Fixnum::HASH_START
  h = h.combine_hash(@a)
  h = h.combine_hash(@b)
  h = h.combine_hash(@c)
end

It may turn out to be more efficient to just do

def hash
  [@a, @b, @c].hash
end

or simply use instances of Struct generated classes and reuse their
implementation.

Requirements for a hash function are pretty clear (and pretty much standard):

1. It should make it as unlikely as possible that two objects which
are not equivalent have the same hash value.
2. Calculation should be as cheap (in terms of CPU cycles) as possible.

http://en.wikipedia.org/wiki/Hash_function#Properties has more detail

As you can see these are no hard numeric or formal requirements and
they both have potential to contradict each other.  So it depends on
the situation what algorithm one chooses.  It is always a trade off
between these two goals and there is no single solution which is best
under all circumstances.

I have attached a stats generator for existing Hashes.  So anybody
interested can play with it and find out the distribution of hash
values for a given Hash.

Kind regards

robert

-- 
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/

Attachments (1)

hash_stat.rb (287 Bytes, application/x-ruby)

In This Thread