Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provenance metadata #2

Open
joshua-d-campbell opened this issue Apr 26, 2022 · 0 comments
Open

Provenance metadata #2

joshua-d-campbell opened this issue Apr 26, 2022 · 0 comments
Labels
documentation Improvements or additions to documentation

Comments

@joshua-d-campbell
Copy link
Collaborator

The goal of a set of provenance fields is to maintain a complete history of analysis steps that occured to generate a matrix or annotation. This can be broken down into an easier and harder problem:

Easier problem: Have a list of fields that describe how the current matrix was created from a previous matrix or matrices. Fields will likely include software package, version of software package, algorithm, function, or command, parameters using in the algorithm/command, and links to ids of parent matrices.

Harder problem: The ability to have a a log of all previous steps recorded. This may not be too bad for a series of matrices that get created from one another (e.g. A -> B -> C). It may be more challenging when matrices get concatenated or subsetted. For example, doublet detection and ambient RNA estimation occurs on matrices from individual samples. So the command may only apply to a subset of cells belong to one individual/sample within a matrix that has multiple samples. Or the tools could have been run on matrices for individuals first and then these matrices get concatenated later.

@joshua-d-campbell joshua-d-campbell added the documentation Improvements or additions to documentation label Apr 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant