[#109207] [Ruby master Feature#18915] New error class: NotImplementedYetError or scope change for NotImplementedYet — Quintasan <noreply@...>
Issue #18915 has been reported by Quintasan (Michał Zając).
18 messages
2022/07/14
[ruby-core:109161] [Ruby master Bug#18898] IO#set_encoding with invalid arguments leads to a segfault
From:
"javanthropus (Jeremy Bopp)" <noreply@...>
Date:
2022-07-07 14:13:54 UTC
List:
ruby-core #109161
Issue #18898 has been updated by javanthropus (Jeremy Bopp).
It looks like the internal encoding should be set to `Encoding.default_internal` when an invalid encoding name is given as the second argument. The equivalent is giving `nil` as the second argument instead as far as I can tell. This script demonstrates the difference in behavior:
```ruby
def show(io)
printf(
"external encoding: %-25p internal encoding: %-25p\n",
io.external_encoding,
io.internal_encoding
)
end
Encoding.default_external = 'iso-8859-1'
Encoding.default_internal = 'iso-8859-2'
File.open('/dev/null') do |f|
f.set_encoding('utf-8', nil)
show(f)
f.set_encoding('utf-8', 'invalid')
show(f)
Encoding.default_external = 'iso-8859-3'
Encoding.default_internal = 'iso-8859-4'
show(f)
end
```
In the 1st case, we see that the IO's internal encoding is set to the current setting of `Encoding.default_internal`. In the 2nd case, the IO's internal encoding is set to `Encoding.default_external` instead. The 3rd case more interesting because it shows that the IO's internal encoding is actually *following* the current setting of `Encoding.default_external`. It didn't just copy it when `#set_encoding` was called. It changes whenever `Encoding.default_external` changes.
I already opened a related issue for inconsistent handling of arguments for `IO#set_encoding` (#18899). Should I move this facet of the problem to that issue instead? Should I open a separate issue altogether?
----------------------------------------
Bug #18898: IO#set_encoding with invalid arguments leads to a segfault
https://bugs.ruby-lang.org/issues/18898#change-98303
* Author: javanthropus (Jeremy Bopp)
* Status: Closed
* Priority: Normal
* ruby -v: ruby 3.1.2p20 (2022-04-12 revision 4491bb740a) [x86_64-linux]
* Backport: 2.7: UNKNOWN, 3.0: UNKNOWN, 3.1: UNKNOWN
----------------------------------------
Save the following to a file and run it:
```ruby
#!/usr/bin/env ruby
Encoding.default_external = 'utf-8'
File.open(__FILE__) do |f|
f.set_encoding('utf-8', 'invalid')
printf(
"default external: %p\ndefault internal: %p\nexternal: %p\ninternal: %p\n\n",
Encoding.default_external,
Encoding.default_internal,
f.external_encoding,
f.internal_encoding
)
f.read
end
```
The above script will result in a segfault at `f.read`. This seems to happen because the call to `#set_encoding` results in the internal encoding of the IO object being set to follow `Encoding.default_external` while also setting the external encoding of the IO object to match. Ovbiously, there shouldn't be a segfault, but I actually expected the IO object's internal encoding to be set to nil due to the invalid encoding being specified for it.
I was able to reproduce this on all versions of Ruby from 2.7.0 to 3.0.2.
--
https://bugs.ruby-lang.org/
Unsubscribe: <mailto:ruby-core-request@ruby-lang.org?subject=unsubscribe>
<http://lists.ruby-lang.org/cgi-bin/mailman/options/ruby-core>