Skip to content

[Question] Why do Document-Based Scatterplots need category? #44

@fredguth

Description

@fredguth

Sorry to ask via issue tracker, tried to find the answer in the referred arxived article and did not know of any other better channel.

I am trying to figure out how the Document-Based Scatterplot works.

I get that it uses Tf-Idf on unigrams of the text and takes the 2 first unigrams of the vector (the most different terms?) as axis. But what function is applied to each document to find its x-y position? Its "nearess" to each term?

Besides, I don't understand why we need to provide Category in this case. I understood it uses category to colorize the points, but anything else? Because if it's just that, it seems a hard constraint to Document-Based Scatterplot for something one may not need. But I guess I am missing something.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions