Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to import a large file (?) #1841

Closed
Manaswwm opened this issue Nov 14, 2017 · 11 comments
Closed

Unable to import a large file (?) #1841

Manaswwm opened this issue Nov 14, 2017 · 11 comments

Comments

@Manaswwm
Copy link

I have imported a large network in python (1000+ nodes and about 10000+ edges), I have processed it with networkx package and have exported the data in a '.graphml' file (file size ~4 MB). This file also contains information on color of nodes that are present in my network(as it is absolutely essential for me to be able to distinguish between some nodes).

When I try to open this file using Gephi, the screen does not show anything after I select the file i.e File -> Open -> My_Graphml_file.graphml , following which I do not get a thing in all the three tabs.

Surprisingly, even if I try to import some other file which does not have any color information and about the same size as the one I described above, it still does not show a thing in all the three tabs. But, if I try to import file with small size, I get something on my screen to work with.

So is it the size problem and if it is then how do I solve it?

@eduramiba
Copy link
Member

Can you share your file (or a minimal example file) for reproducing the problem?

@Manaswwm
Copy link
Author

I think the editor does not support uploading file formats .graphml and .gexf (?)

I am sharing the Dropbox links for those files :
https://www.dropbox.com/s/jryrxy9fz4pjl1u/networkx_graphml.graphml?dl=0 -- this would be the file containing no node color information, but still would not open

https://www.dropbox.com/s/lhztjls2o5tusnd/networkx_gexf.gexf?dl=0 -- this would be the file containing the node color information and would not open

Thanks in advance for help!

@eduramiba
Copy link
Member

Thanks, I will check them when possible. Uploading the files as zip here
files.zip

@Manaswwm
Copy link
Author

If it helps: I get the following exception thrown when I try to upload the large network -

java.lang.IllegalArgumentException: The id can't be empty
at org.gephi.io.importer.impl.ImportContainerImpl.checkId(ImportContainerImpl.java:1035)
at org.gephi.io.importer.impl.ImportContainerImpl.nodeExists(ImportContainerImpl.java:209)
at org.gephi.io.importer.plugin.file.ImporterGEXF.readNode(ImporterGEXF.java:364)
at org.gephi.io.importer.plugin.file.ImporterGEXF.execute(ImporterGEXF.java:153)
at org.gephi.io.importer.impl.ImportControllerImpl.importFile(ImportControllerImpl.java:199)
at org.gephi.io.importer.impl.ImportControllerImpl.importFile(ImportControllerImpl.java:169)
at org.gephi.desktop.importer.DesktopImportControllerUI$4.run(DesktopImportControllerUI.java:341)
Caused: java.lang.RuntimeException
at org.gephi.desktop.importer.DesktopImportControllerUI$4.run(DesktopImportControllerUI.java:349)
[catch] at org.gephi.utils.longtask.api.LongTaskExecutor$RunningLongTask.run(LongTaskExecutor.java:274)
at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source)
at java.util.concurrent.FutureTask.run(Unknown Source)
at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
at java.lang.Thread.run(Unknown Source)

@Manaswwm
Copy link
Author

Short update:
I dropped the idea of coloring nodes, I just wanted to visualize the large network (which has 1226 nodes and 50976 edges). I constructed a .graphml file using python(networkx) for the same and now I try opening it with Gephi (and it does not open). As a control to check whether my network has some error, I try opening the same .graphml file in Cytoscape and it works.
The only reason why I want to open network in Gephi instead of Cytoscape is that the layout options in former are better than the later.

@Manaswwm
Copy link
Author

I figured out the solution to this problem:

I opened the .graphml file that I obtained from python in Cytoscape, then without modifying I exported .graphml from Cytoscape to Gephi and now I am able to visualize the network.

@BonnieSchrack
Copy link

I don't think the large file size is so much the problem, as the file type. I import CSV files of thousands of nodes and over 100000 edges, not that Gephi is happy about excessive numbers of edges, but CSV seems to be a Gephi-friendly file type.

@Yalaoui-Bilal
Copy link

what is the graph size limitations for Gephi ?

@graemediack
Copy link

You could try to use networkx.write_gml instead of .write_graphml, this worked for me after I was having a similiar problem!

@Manaswwm
Copy link
Author

Manaswwm commented Dec 13, 2018 via email

@mbastian mbastian added this to the 0.9.3 milestone Jan 22, 2022
@mbastian mbastian assigned mbastian and unassigned eduramiba Jan 22, 2022
mbastian added a commit that referenced this issue Jan 22, 2022
@mbastian
Copy link
Member

This issue has been fixed and will be included in the upcoming 0.9.3 release. Thank you for the report!

The root cause was because of a node id being a whitespace " " in your file example. As long as the id is not empty it should work but we had a bug that was trimming the id and therefore it would be empty.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants