From: naruse@... Date: 2016-02-29T19:56:19+00:00 Subject: [ruby-core:74061] [Ruby trunk Feature#11625][Assigned] Unlock GVL for SHA1 calculations Issue #11625 has been updated by Yui NARUSE. Status changed from Open to Assigned ext/digest should use OpenSSL, which has many optimizations. But old ruby's ext/digest/sha1 was buggy. Through r52694, r52695, and r52755, I fixed it. Now it correctly uses openssl (if you give --with-openssl-dir or something on OS X) % ./ruby -v shaips.rb (ext/digest/sha1 with cdefs) ruby 2.4.0dev (2016-02-29 trunk 53974) [x86_64-darwin15] Warming up -------------------------------------- sha1 6.193k i/100ms Calculating ------------------------------------- sha1 65.138k (��11.9%) i/s - 322.036k % ./ruby shaips.rb (with libressl) Warming up -------------------------------------- sha1 6.930k i/100ms Calculating ------------------------------------- sha1 85.507k (��21.6%) i/s - 388.080k If you want more speed (without FPGA or ASIC), buy Skylake and use latest openssl with optimized build: https://software.intel.com/en-us/articles/improving-openssl-performance http://git.openssl.org/gitweb/?p=openssl.git;a=commitdiff;h=619b94667cc7a097f6d1e2123c4f4c2c85afb8f7 ---------------------------------------- Feature #11625: Unlock GVL for SHA1 calculations https://bugs.ruby-lang.org/issues/11625#change-57213 * Author: Aaron Patterson * Status: Assigned * Priority: Normal * Assignee: ---------------------------------------- I'm trying to calculate many sha1 checksums, but the current sha1 implementation doesn't unlock the GVL, so I can't do it in parallel. I've attached a patch that unlocks the GVL when calculating sha1sums so that I can do them in parallel. The good point about this patch is that I can calculate sha1's in parallel. Here is the test code I'm using: ~~~ require 'digest/sha1' require 'thread' Thread.abort_on_exception = true THREADS = (ENV['THREADS'] || 1).to_i store = 'x' * (ENV['SIZE'] || 1024).to_i queue = Queue.new 600000.times do queue << store end THREADS.times { queue << nil } ts = THREADS.times.map { Thread.new { while work = queue.pop Digest::SHA1.hexdigest(work) end } } ts.each(&:join) ~~~ Here is what the output looks like after I've applied the patch: ~~~ [aaron@TC ruby (trunk)]$ THREADS=1 SIZE=4096 time ./ruby test.rb 22.62 real 21.78 user 0.66 sys [aaron@TC ruby (trunk)]$ THREADS=4 SIZE=4096 time ./ruby test.rb 15.87 real 34.53 user 8.27 sys [aaron@TC ruby (trunk)]$ ~~~ The digests that I'm calculating are for fairly large strings, so this patch works well for me. The downsides are that it seems slightly slower (though I'm not sure that it's significant) with a single thread: Test code: ~~~ require 'benchmark/ips' require 'digest/sha1' Benchmark.ips do |x| x.report('sha1') { Digest::SHA1.hexdigest('x' * 4096) } end ~~~ Before my patch (higher numbers are better): ~~~ [aaron@TC ruby (trunk)]$ ./ruby shaips.rb Calculating ------------------------------------- sha1 2.604k i/100ms ------------------------------------------------- sha1 27.441k (�� 3.9%) i/s - 138.012k ~~~ After my patch: ~~~ [aaron@TC ruby (trunk)]$ ./ruby shaips.rb Calculating ------------------------------------- sha1 2.419k i/100ms ------------------------------------------------- sha1 25.848k (�� 2.8%) i/s - 130.626k ~~~ Other downside is that I changed the `update` method to dup strings so that the GVL can be safely released. This patch pays off for me because of the size of the strings I'm working with, but I'm not sure if it's fine for the general case. ---Files-------------------------------- sha1gvl.diff (3.79 KB) -- https://bugs.ruby-lang.org/ Unsubscribe: