Skip to content
This repository was archived by the owner on Oct 21, 2020. It is now read-only.
This repository was archived by the owner on Oct 21, 2020. It is now read-only.

XML Parser cannot import a GEXF generated with a node which domain_title the value created from a <title> with &#1; (in the HTML version)) #216

@DavidBruant

Description

@DavidBruant

Found a page with <title>&#1;</title> (and other characters in the HTML but they're likely irrelevant). Let's try to make a reduce test case out of this.

Tentative test case for Gephi (maybe Github will mess up encoding. Important part is what is before the a in the attribute value):

<?xml version="1.0" encoding="UTF-8"?>
<gexf xmlns="http://www.gexf.net/1.2draft" version="1.2">
    <graph mode="static" defaultedgetype="directed">
        <attributes class="node">
            <attribute id="domain_title" title="domain_title" type="string"/>
        </attributes>
        <attributes class="edge">
            <attribute id="weight" title="weight" type="integer"/>
        </attributes>
        <nodes>
            <node id="n1" label="n1">
                <attvalues>
                    <attvalue for="domain_title" value="�a"/>
                <attvalues>
            </node>
        </nodes>
        <edges>
        </edges>
    </graph>
</gexf>

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions