Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for ENA #26

Open
TuomasBorman opened this issue Mar 28, 2024 · 4 comments
Open

Add support for ENA #26

TuomasBorman opened this issue Mar 28, 2024 · 4 comments

Comments

@TuomasBorman
Copy link
Collaborator

(Meta)-transcriptomics data is stored in European Nucleotide Archive, ENA. However, there is no counts table but only sequences (example).

As there are no counts tables, the data is not directly usable for downstream analysis. However, it still might be beneficial to have a function that fetches all the files to single directory for further processing. --> Consider adding getENAFile function.

@SandyRogers
Copy link
Member

@TuomasBorman the bit of code in the data portal itself that fetches the ENA file report, for display on the website only (not in the API) is in: portal_api.py. This might be helpful for implementing such a function in HoloFoodR...

@SandyRogers
Copy link
Member

Also, I think the metatranscriptomic raw datasets that are not yet analysed by MGnify (such as your example
PRJEB66287) could be analysed and then counts tables retrieved from MGnify.

@SandyRogers
Copy link
Member

Also, I think the metatranscriptomic raw datasets that are not yet analysed by MGnify (such as your example PRJEB66287) could be analysed and then counts tables retrieved from MGnify.

In fact these are in progress, so once complete there will be MGnify analyses of these metaT samples.

@TuomasBorman
Copy link
Collaborator Author

OK, thanks @SandyRogers! Good to know.

As this package is for fetching the data for downstream analysis, the most optimal solution is fetching the counts table from MGnify (and not raw datasets).

I will keep this issue open, and once metaT is available:

  • HoloFoodR: update examples and this warning

    # If there are samples that user wanted to include but are not present in

  • HoloFood data portal is now pointing to ENA, but it should then point also to MGnify? --> check that HoloFoodR is updated accordingly

  • MGnifyR should work already without any changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants