-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Hi @amcooksey - NCBI has an alpha version out for their eukaryotic annotation pipeline and are looking for feedback. It would be nice if we could test this to see if this could be an option for us in the future. This is not a huge priority now - revisit this issue whenever you have a gap. That said some of the Ag100Pest group are interested in how this goes so it would be great if you could share notes with them.
"Documentation about EGAPx are available on https://github.com/ncbi/egapx, with the egapx.py runner script available from GitHub and all software available in a Docker container for execution using either Docker or Singularity.
EGAPx currently works well for vertebrates and select arthropods (diptera, hymenoptera, lepidoptera, choleoptera), and we'll be adding support for plants and more arthropod orders in the next few months."
Could you:
- do a test run with the example data on Ceres
- Try out the pipeline with:
- Document what resources you allocated + how long it took; any other setup/execution oddities
- TBD: We'd need to compare the output with RefSeq's output. What program we use would probably depend on the extent of the changes. Perhaps https://ccb.jhu.edu/software/stringtie/gffcompare.shtml, or our gff3_merge program.