Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create md5 sums for data upload to public repositories #32

Open
drpatelh opened this issue Apr 13, 2019 · 0 comments
Open

Create md5 sums for data upload to public repositories #32

drpatelh opened this issue Apr 13, 2019 · 0 comments
Labels
enhancement New feature or request

Comments

@drpatelh
Copy link
Member

drpatelh commented Apr 13, 2019

Public data repositories such as GEO require md5 sums to be included in the metasheet containing the experimental details. This can be quite a painful process to carry out down the line but it's something that can be automated during the pipeline. I'd imagine that the md5 sums would be generated for the raw fastqs, processed bam, peaks and bigwig files. Would also have to parse the Picard insert size metrics file to get the insert mean and std dev. These could all be collected at the end of the pipeline and placed in a tsv file that could then be copy and pasted appropriately into the metasheet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant