`calculateLD.R`: Fixed incorrect SNP map #178

deepchocolate · 2023-05-24T17:22:10Z

Fixes #177
Fixes #227

Changes proposed in this pull request:

Filter genotype map by SNP RSIDs in calculateLD.R
Moved testing of LD estimation and analysis to a script of their own: tests/test_LDpred2/scripts/ld.sh
Added test of ldpred2.R using LD data filtered by --extract (SNPs) and --sample-individuals
Added script to create blocks in LD matrixes: splitLD.R
Added script to analyze and plot LD: analyzeLD.R: Example plot:

Plotting functionality modified from: privefl/bigsnpr#4

Before submitting

I've read and followed all steps in the Making a pull request
section of the CONTRIBUTING docs.
I've updated or added any relevant docstrings following the syntax described in the
Writing docstrings section of the CONTRIBUTING docs.
If this PR fixes a bug, I've added a test that will fail without my fix.
If this PR adds a new feature, I've added tests that sufficiently cover my new functionality.

…omorment/containers into deepchocolate/calculate_ld_fix

espenhgn · 2023-11-29T13:50:04Z

The "Greetings" action failure is outside our control and can be ignored: actions/first-interaction#286

espenhgn

Thanks @deepchocolate. Looks good to me, only bumped the version file + a couple minor fixes. The LD unittests failed on my end however, because of wrong file path(s). I've pushed a couple of fixes hardcoding the /ldpred2_ref mount point in calls pointing to these datas.

deepchocolate · 2024-02-05T15:10:24Z

Thanks @deepchocolate. Looks good to me, only bumped the version file + a couple minor fixes. The LD unittests failed on my end however, because of wrong file path(s). I've pushed a couple of fixes hardcoding the /ldpred2_ref mount point in calls pointing to these datas.

Great, thanks for the review! I planned to do a test run on real data prior to merging, but haven't been able to login to TSD since Friday, so leaving it hanging here until that.

espenhgn · 2024-02-05T15:18:27Z

Thanks @deepchocolate. Looks good to me, only bumped the version file + a couple minor fixes. The LD unittests failed on my end however, because of wrong file path(s). I've pushed a couple of fixes hardcoding the /ldpred2_ref mount point in calls pointing to these datas.

Great, thanks for the review! I planned to do a test run on real data prior to merging, but haven't been able to login to TSD since Friday, so leaving it hanging here until that.

Perhaps good to run a test in MoBA or similar. But can confirm, TSD p697 rhel login machines are unreachable. But you can use Windows VM -> Putty -> p697-appnode-norment01 to get access to the resource.

deepchocolate · 2024-02-13T17:37:25Z

scripts/pgs/LDpred2/fun.R

+#' @param quantile Range of estimates to keep
+getBetasAuto <- function (fitAuto, quantile=0.95, verbose=T) {
+  range <- sapply(fitAuto, function (auto) diff(range(auto$corr_est)))
+  # Keep chains that pass the filtering below


@espenhgn I noted that the problem om missing values start here (l199) and not at the quantile function below. There needs to be a na.rm=T to range as well. This is because if there are missing values in corr_est there will be missing in range and these will inevitably lead to missing values when assigning the keep variable below. Right now all chains with any missingness are thrown away, and I'm unsure if that's optimal

Not sure either.
Don't think it matters here, but perhaps best not to define a variable range replacing the base::range function within this function.

Yea, that's a good point, will change the name

na.rm=T in range causes other issues if an entire chain is missing, so will leave for now and address later.

deepchocolate · 2024-02-16T09:52:20Z

scripts/pgs/LDpred2/calculateLD.R

+# Extract individuals
+if (!is.na(extractIndividuals)) {
+  indIds <- filterFromFile(obj.bigSNP$fam, extractIndividuals, colFilter='sample.ID')
+  cat('Extracting ', sum(indIds), ' individuals\n')


This line throws an error as indIds is the same object type as obj.bigSNP$fam. sum should be changed to nrow and on line 103 indIds should have been indIds$fam$sample.ID. Will change name of indIds to something better.
This was not detected in the tests and as far as I can see this parameter does not have a test. Will add that.

espenhgn · 2024-02-21T10:43:10Z

Hi @deepchocolate; @ofrei and I discussed restructuring some of the documentation and figured it's good to merge this soon. Is it ready enough?

deepchocolate · 2024-02-21T12:34:23Z

Hi @deepchocolate; @ofrei and I discussed restructuring some of the documentation and figured it's good to merge this soon. Is it ready enough?

Yes, I can merge it today. There's still issues with calculating your own LD as it causes optimization errors during scoring. But I will leave that for now. I've tested most other things on MoBa data now and that has worked fine.

Fixed bug in calculate LD and created test

00cfcdc

deepchocolate changed the title ~~Fixed bug in calculate LD and created test~~ calculateLD.R: Fixed incorrect SNP map May 24, 2023

deepchocolate added 2 commits May 31, 2023 11:55

Work in progress

9ad0d66

Added --extract-individuals argument

dbe632e

espenhgn mentioned this pull request Jun 2, 2023

LDpred2: Error when filtering characters in chromosome column #179

Closed

deepchocolate added 8 commits June 12, 2023 13:32

Merge branch 'main' into deepchocolate/calculate_ld_fix

5558d70

Work in progress

f0e87cc

Merge branch 'main' into deepchocolate/calculate_ld_fix

33868a3

Work in progress

cf696ab

Work in progress

67fdc54

Work in progress

fca72b0

Work in progress

4abc69a

Updated readme

18dea32

espenhgn mentioned this pull request Sep 13, 2023

Collect feedback and include additional packages into R container #197

Closed

deepchocolate added 13 commits October 10, 2023 10:14

Merge branch 'main' into deepchocolate/calculate_ld_fix

4889cf5

Work in progress

0baa89c

Work in progress

df6f319

Merge branch 'deepchocolate/calculate_ld_fix' of https://github.com/c…

8fbd972

…omorment/containers into deepchocolate/calculate_ld_fix

Added testing functions

13eca9c

Work in progress

2caeeb8

Merge branch 'main' into deepchocolate/calculate_ld_fix

f3bfe3b

Added unittests of output from calculateLD.R and analyzeLD.R

f8e7f17

Commented

f596d8c

Cleanup

a6dabb8

Added coloring

98b45a1

Removed obsolete file

cd93d60

Fixe issues with LD calculation and testing

091cccc

deepchocolate added 2 commits January 10, 2024 20:05

Implemented SNP filtering function

7c850ad

Work in progress

03f9d8f

Updated changelog

9ffd405

deepchocolate requested review from espenhgn and ofrei January 31, 2024 20:41

espenhgn marked this pull request as ready for review February 1, 2024 13:26

espenhgn added the enhancement New feature or request label Feb 2, 2024

espenhgn added 4 commits February 5, 2024 12:53

bump version; couple fixes

bdd453f

fix path to ldpred2_ref data

ac10b58

assume /ldpred2_ref mount point in test scripts

a8bc085

this will be version 1.8.0

24fab86

espenhgn approved these changes Feb 5, 2024

View reviewed changes

Merge branch 'main' into deepchocolate/calculate_ld_fix

7fa39af

espenhgn mentioned this pull request Feb 9, 2024

A small bug : na.rm is missing : quantile (na.rm = T, ...) #227

Closed

deepchocolate added 4 commits February 13, 2024 12:17

Moved code into function

bfee77f

Fixed message

ed5bd4e

Added tests

1bde298

Clarified message

e85d944

deepchocolate commented Feb 13, 2024

View reviewed changes

espenhgn and others added 3 commits February 14, 2024 10:12

Merge branch 'main' into deepchocolate/calculate_ld_fix

23e7c92

Renamed variable

78f3e5e

Added test

ff7113b

deepchocolate commented Feb 16, 2024

View reviewed changes

deepchocolate added 2 commits February 21, 2024 11:24

Added file for testing filtering

31d0cc8

Fixed filtering of individuals

37137ac

Added test of filtering individuals in LD estimation

b2f536f

deepchocolate merged commit b2f536f into main Feb 21, 2024
6 checks passed

deepchocolate deleted the deepchocolate/calculate_ld_fix branch February 21, 2024 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`calculateLD.R`: Fixed incorrect SNP map #178

`calculateLD.R`: Fixed incorrect SNP map #178

deepchocolate commented May 24, 2023 •

edited

Loading

espenhgn commented Nov 29, 2023

espenhgn left a comment

deepchocolate commented Feb 5, 2024

espenhgn commented Feb 5, 2024

deepchocolate Feb 13, 2024

espenhgn Feb 14, 2024

deepchocolate Feb 14, 2024

deepchocolate Feb 16, 2024

deepchocolate Feb 16, 2024

espenhgn commented Feb 21, 2024

deepchocolate commented Feb 21, 2024

calculateLD.R: Fixed incorrect SNP map #178

calculateLD.R: Fixed incorrect SNP map #178

Conversation

deepchocolate commented May 24, 2023 • edited Loading

Before submitting

espenhgn commented Nov 29, 2023

espenhgn left a comment

Choose a reason for hiding this comment

deepchocolate commented Feb 5, 2024

espenhgn commented Feb 5, 2024

deepchocolate Feb 13, 2024

Choose a reason for hiding this comment

espenhgn Feb 14, 2024

Choose a reason for hiding this comment

deepchocolate Feb 14, 2024

Choose a reason for hiding this comment

deepchocolate Feb 16, 2024

Choose a reason for hiding this comment

deepchocolate Feb 16, 2024

Choose a reason for hiding this comment

espenhgn commented Feb 21, 2024

deepchocolate commented Feb 21, 2024

`calculateLD.R`: Fixed incorrect SNP map #178

`calculateLD.R`: Fixed incorrect SNP map #178

deepchocolate commented May 24, 2023 •

edited

Loading