Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Any pull request that does not include enough information to be reviewed in a timely manner may be closed at the maintainers' discretion
Requirements
Description
New xref mapping pipeline written in Nextflow and python, as well as fixes and improvement to the download pipeline.
Use case
This PR contains the next step of the xref refactoring and rewriting using Nextflow and python, which started with the download part and now the mapping part. The general flow of the pipeline is still the same. Various improvements and fixes for long-standing issues have been added. In addition, a test suite was added, though it doesn't cover every module yet. The Confluence page for the new xref pipelines has been updated.
The new pipeline has been tested on multiple vertebrate and plant species, both independently and together, and is working as expected. It has also been tested with human and mouse (along with the modified download pipeline for gencode), though few issues are still being checked.
Benefits
New pipeline with new technologies.
Possible Drawbacks
Refer to "Known Issues" section in Confluence page.
Testing
Dependencies