Escape default arrays and sequences the same other default values #620

jacobperron · 2021-09-30T01:08:55Z

Fixes #610

jacobperron · 2021-09-30T01:09:43Z

As a regression test, I've added a new field to one of the test messages: ros2/test_interface_files#16

jacobperron · 2021-09-30T01:10:49Z

Linux
Linux-aarch64
macOS
Windows

jacobperron · 2021-09-30T01:15:45Z

Rpr fails because ament/ament_cmake#352 has not been released.

sloretz · 2021-09-30T01:27:48Z

This is interesting. I can't predict if it will pass the test_communication tests. I recommend adding another test to test_interface_files with "ハローワールド" to see what it does when given a string with characters outside the latin-1 character set.

https://github.com/ros2/test_interface_files/blob/ea4d4f33eca97f37b4294e6ab012fa0f216de609/msg/WStrings.msg#L2-L4

IIUC Hellö wörld!" in that .msg file is encoded as the UTF-8 bytes b'Hell\xc3\xb6 w\xc3\xb6rld!'. When read as latin-1 the string in Python would be very different from the original.

>>> b'Hell\xc3\xb6 w\xc3\xb6rld!'.decode('latin-1')
'HellÃ¶ wÃ¶rld!'

When re-encoded as latin-1 that should output the original bytes, but there's some more weirdness when the generators read the idl files.

rosidl/rosidl_parser/rosidl_parser/parser.py

Line 660 in 36ed120

return codecs.decode(match.group(0), 'unicode-escape')

jacobperron · 2021-10-05T01:03:42Z

IIUC Hellö wörld!" in that .msg file is encoded as the UTF-8 bytes b'Hell\xc3\xb6 w\xc3\xb6rld!'. When read as latin-1 the string in Python would be very different from the original.

In fact there were test failures related to this that I missed locally: https://ci.ros2.org/job/ci_linux/15332/testReport/junit/rosidl_generator_py.test/test_interfaces/test_wstrings/

Fix #610 Apply the same encode/decode pattern and escaping as for other default values. Signed-off-by: Jacob Perron <[email protected]>

jacobperron · 2021-10-14T20:05:36Z

I think the issue was due to a difference in how we handled default values of arrays and sequences compared with other default values. See 8253770, which applies similar logic to array/sequence defaults as we do with other defaults, e.g.

rosidl/rosidl_adapter/rosidl_adapter/msg/__init__.py

Lines 78 to 79 in 36ed120

    
           estr = string.encode().decode('unicode_escape') 
        
           estr = estr.replace('"', r'\"')

Linux
Linux-aarch64
macOS
Windows

jacobperron requested a review from sloretz September 30, 2021 01:08

jacobperron mentioned this pull request Sep 30, 2021

Add array of wstring with default value ros2/test_interface_files#16

Open

jacobperron mentioned this pull request Sep 30, 2021

Colcon build fails with UnicodeDecodeError for WstringArrays in Foxy, but successful in dashing #610

Open

jacobperron self-assigned this Oct 5, 2021

Escape default arrays and sequences the same other default values

8253770

Fix #610 Apply the same encode/decode pattern and escaping as for other default values. Signed-off-by: Jacob Perron <[email protected]>

jacobperron force-pushed the jacob/rolling_fix_610 branch from 548af14 to 8253770 Compare October 14, 2021 20:02

jacobperron changed the title ~~Use latin-1 encoding for reading interface file content~~ Escape default arrays and sequences the same other default values Oct 14, 2021

audrow changed the base branch from master to rolling June 28, 2022 14:23

jacobperron removed their assignment May 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Escape default arrays and sequences the same other default values #620

Escape default arrays and sequences the same other default values #620

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

sloretz commented Sep 30, 2021

jacobperron commented Oct 5, 2021

jacobperron commented Oct 14, 2021

Escape default arrays and sequences the same other default values #620

Are you sure you want to change the base?

Escape default arrays and sequences the same other default values #620

Conversation

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

jacobperron commented Sep 30, 2021

sloretz commented Sep 30, 2021

jacobperron commented Oct 5, 2021

jacobperron commented Oct 14, 2021