Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Determination of uniformity and periodicity thresholds #3

Open
marcasriv opened this issue Jul 2, 2021 · 1 comment
Open

Determination of uniformity and periodicity thresholds #3

marcasriv opened this issue Jul 2, 2021 · 1 comment

Comments

@marcasriv
Copy link

Dear authors,

I've a question related to the procedure that you followed to classify translated and untranslated ORFs after finishing running ORQAS. How do you exactly determine the cut-off values for the uniformity and periodicity? Also, I've noticed that in Supplementary Figure 1 each different dataset presents different cutoffs, therefore do cutoffs need to be re-calculated for each dataset?

Thanks so much,

Marina

@mars13
Copy link
Contributor

mars13 commented Jul 27, 2021

To determine which values of uniformity and periodicity are indicative of an isoform being translated, we selected the set of positive controls that appear in blue in fig. 1B and sup. fig. 1. Those are ORFs with evidence of protein expression from mass spectrometry (MS), immunohistochemistry (IHC)and uniprot in all 37 tissues available the HPA. For the mouse samples, the positive controls are genes with one-to-one orthology with the human positive controls. Since every dataset can have different coverage or differences in the protocol that could affect uniformity and periodicity, we calculate in each dataset our thresholds based on the values of what we expect as translated. That said, we considered the translation of those ORFs within 90% of the periodicity and uniformity distribution of the positive controls, allowing for some variability from the samples or sequencing even in the positive set. So, yes, we recommend recalculating thresholds for each dataset and we strongly encourage it when the data comes from different experiments (i.e you can see how different are the thresholds between our glia/glioma and hippocampus datasets).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants