Skip to content

Handle weird character escape sequence #29

@jarredholman

Description

@jarredholman

The OOXML spec allows non-printable characters to be encoded using an escape sequence _xHHHH_ where HHHH is the hex code of the character.

I have received a few xlsx files that contain _x000D_ sequences (the carriage return character).

If you load the file in creek (or open the xml file in a text editor), the cell contents will be
"first line_x000D_\nsecond line"
Where _x000D_ is the carriage return character, and the line feed character is encoded normally.

When you open it in excel you just see a newline.

see:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions