[ruby-core:123639] [Ruby Bug#21559] Unicode normalization nfd -> nfc -> nfd is not reversible
From:
duerst via ruby-core <ruby-core@...>
Date:
2025-11-02 02:06:45 UTC
List:
ruby-core #123639
Issue #21559 has been updated by duerst (Martin D=FCrst).
Status changed from Open to Closed
Added regression test at https://github.com/ruby/ruby/commit/a122d7a58e91ed=
6cd531e906cb398688d7cc8b17
and fix at https://github.com/ruby/ruby/commit/e4c8e3544237b8c0efba6b945173=
dc66552d641c.
Many thanks to Tomoya Ishida for finding this bug.
----------------------------------------
Bug #21559: Unicode normalization nfd -> nfc -> nfd is not reversible
https://bugs.ruby-lang.org/issues/21559#change-115023
* Author: tompng (tomoya ishida)
* Status: Closed
* Assignee: duerst (Martin D=FCrst)
* Backport: 3.2: UNKNOWN, 3.3: UNKNOWN, 3.4: UNKNOWN
----------------------------------------
I expect `nfd(nfc(str)) =3D=3D nfd(str)` but found a string that doesn't.
~~~ruby
# Ruby 3.1 - 3.5
str =3D "s\u{11930}\u{323}\u{11930}\u{307}"
p str.unicode_normalize(:nfd) =3D=3D str.unicode_normalize(:nfc).unicode_no=
rmalize(:nfd)
#=3D> false
~~~
~~~ruby
# ruby 3.5.0dev
str =3D "s\u{1611e}\u{323}\u{1611e}\u{307}\u{1611f}"
p str.unicode_normalize(:nfd) =3D=3D str.unicode_normalize(:nfc).unicode_no=
rmalize(:nfd)
#=3D> false
~~~
--=20
https://bugs.ruby-lang.org/
______________________________________________
ruby-core mailing list -- ruby-core@ml.ruby-lang.org
To unsubscribe send an email to ruby-core-leave@ml.ruby-lang.org
ruby-core info -- https://ml.ruby-lang.org/mailman3/lists/ruby-core.ml.rub=
y-lang.org/