Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Uploader of Risk Score is Unclear #291

Open
DarioS opened this issue Nov 2, 2023 · 2 comments
Open

Uploader of Risk Score is Unclear #291

DarioS opened this issue Nov 2, 2023 · 2 comments
Labels
data issue Issue with data question Further information is requested

Comments

@DarioS
Copy link

DarioS commented Nov 2, 2023

PGS000116 is anti-correlated to all of the other scores in EFO_0001645. Moreover, the journal article associated with it never mentions PGS000116, not even in supplementary text. So, it makes me think that the creator didn't upload it but someone else did. Was it mistakenly multiplied by -1 and is therefore a resilience rather than disease risk score? I also wonder about PGS003727 ...
image
Could the score overview web page have additional detail about precisely who uploaded the score?

@smlmbrt smlmbrt added the question Further information is requested label Nov 2, 2023
@smlmbrt
Copy link
Member

smlmbrt commented Nov 2, 2023

Hi @DarioS, correct many of the scores in the Catalog are curated by extracting information from the source publications. I have previously double checked the original author-reported files from their figshare and we are currently consistent with their notation. Indeed, others have noted that this score appears flipped to us and when I’ve used it in my own work I’ve used the score as-is but taken the reciprocal effect size to make it the same direction as the other CAD scores when comparing it’s performance. I’ll e-mail the authors again to double-check, but we try to keep as close to the author-reported files as possible (so the data provenance is clear).

We will consider your suggestion about whether to mark the source of the data, it it known to us internally.

@DarioS
Copy link
Author

DarioS commented Nov 2, 2023

Oh, I thought that it was author-uploaded like G.E.O. Good to know that it is usually the database maintainers instead.

@ens-lgil ens-lgil added the data issue Issue with data label Nov 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data issue Issue with data question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants