[ruby-core:118711] [Ruby master Misc#20652] Memory allocation for gsub has increased from Ruby 2.7 to 3.3
From:
"ko1 (Koichi Sasada) via ruby-core" <ruby-core@...>
Date:
2024-07-27 22:04:41 UTC
List:
ruby-core #118711
Issue #20652 has been updated by ko1 (Koichi Sasada).
Eregon (Benoit Daloze) wrote in #note-14:
> ko1 (Koichi Sasada) wrote in #note-11:
> > I found an idea that each thread points to unescaped MatchData rather than `$~` and reuse it.
>
> I think that's too incompatible because `$~` is frame-local and thread-local, so we need multiple `$~` per thread, as @byroot showed.
No. It is not user visible behavior so no incompatiblity.
(don't change `$~`)
----------------------------------------
Misc #20652: Memory allocation for gsub has increased from Ruby 2.7 to 3.3
https://bugs.ruby-lang.org/issues/20652#change-109246
* Author: orisano (Nao Yonashiro)
* Status: Open
* Assignee: jeremyevans0 (Jeremy Evans)
----------------------------------------
I recently upgraded from ruby 2.7.7 to 3.3.1 and noticed that the GC load increased.
When I used the allocation profiler to investigate, I found that memory allocation from gsub had increased.
The problem was code like this:
```ruby
s = "foo "
s.gsub(/ (\s+)/) { " #{' ' * Regexp.last_match(1).length}" }
```
When I compared the results of heap-profiler between 2.7.7 and 3.3.1, I found that MatchData was increasing.
https://gist.github.com/orisano/98792dee260106e9b6fcb45bbabeb1e6
https://github.com/ruby/ruby/commit/abc0304cb28cb9dcc3476993bc487884c139fd11
I discovered that the cause is this commit, which stopped reusing backref to avoid race conditions.
Is there a way to reuse backref while still avoiding race conditions?
--
https://bugs.ruby-lang.org/
______________________________________________
ruby-core mailing list -- ruby-core@ml.ruby-lang.org
To unsubscribe send an email to ruby-core-leave@ml.ruby-lang.org
ruby-core info -- https://ml.ruby-lang.org/mailman3/lists/ruby-core.ml.ruby-lang.org/