Releases · lcpilling/ukbrapR

29 Jan 23:04

lcpilling

v0.3.0

22b1578

ukbrapR v0.3.0 Latest

Latest

New features

Suite of functions to extract and load genetic variants. Main ones of interest will be:
1. extract_variants() takes a list of variant rsIDs as input and extracts the imputed genotypes, loading to memory. This is really a wrapper around two other new functions: make_imputed_bed() and load_bed(). Also available is make_dragen_bed() to extract from whole genome sequence VCF files but this is pretty slow so usually user wants imputed variants.
2. create_pgs() creates a polygenic score (weighted allele score) using user-provided variants and weights. Loaded to memory but also saves a nicely formatted .tsv

Breaking changes

Removing dependencies: reticulate, arrow, sparklyr. These take a few previous seconds to install every time and are rarely needed. Instead will be installed if user tries to use get_rap_phenos()
get_emr_spark() removed entirely. Much better to use get_diagnoses() which has had a lot of updates to functionality and bug fixes.

Assets 2

12 Jan 21:57

lcpilling

v0.2.9

9c4d401

ukbrapR v0.2.9

Bug fixes

Fixes for issue #19 (thanks to @nsandau for the help):
1. Where OPCS searches were not always performed correctly if only OPCS3/4 codes were provided.
2. When using "group_by" in get_df() some diagnoses were incorrectly carried over between groups when different vocabs were provided for each group (condition).

Updates

Additional checking of get_diagnoses() input to abort if "blank" codes are provided to the grep.
When getting date first from self-reported illness data exclude "year" if < 1936 (earliest birth year for any participant)

Contributors

nsandau

Assets 2

07 Oct 08:57

lcpilling

v0.2.8

e0c5ab3

ukbrapR v0.2.8

Bug fixes

Baseline dates TSV is now correctly located even if user changes working directory
HES operations dates were sometimes parsed as character - this is now fixed to parse as dates

Updates

Warnings relating to parsing issues during grepping that are safe to ignore are now suppressed
Updates to documentation / examples / pkgdown site
New website articles to ascertain_diagnoses, label_fields and for spark_functions

Assets 2

30 Sep 20:38

lcpilling

v0.2.7

b1bbd56

ukbrapR v0.2.7

Updates

New function label_ukb_field() allows user to add titles and labels to UK Biobank fields provided as integers but are categorical.
New function label_ukb_fields() is a wrapper for the above. User just provides a data frame containing UK Biobank fields, and they all get formatted with titles (and labels if categorical).
Data from the UK Biobank schema (https://biobank.ctsu.ox.ac.uk/crystal/schema.cgi) are stored internally in ukbrapR:::ukb_schema
{haven} dependency added for labelling
Exported baseline_dates.tsv now also includes the assessment centres for completeness (but keeps the same filename to avoid any issues for current projects relying on already-exported files)

Assets 2

17 Sep 09:20

lcpilling

v0.2.6

1282e4f

ukbrapR v0.2.6

Bug fix

Fix for issue #10. Grep issues if user provided only Read2 or CTV3 codes, if Read2 or CTV3 were <5 characters, or if Read2/CTV3 codes contained a hyphen. Thanks to @Simon-Leyss for highlighting.
Fix for issue #11. When getting self-reported illness codes there was a problem joining the tables if user only provided cancer codes. Thanks to @LauricF for highlighting.
Fix for when both types self-reported illness codes were provided. (Incorrect subsetting to just those codes provided after pivoting the long object.)

Contributors

LauricF and Simon-Leyss

Assets 2

07 Sep 16:23

lcpilling

v0.2.5

9d45f5c

ukbrapR v0.2.5

Bug fix

When getting the date first cancer registry diagnosis, some rows were duplicated. This is now fixed so only one row per participant (the date first for any matched cancer ICD10) is returned.

Assets 2

05 Sep 14:12

lcpilling

v0.2.4

dbb3d19

ukbrapR v0.2.4

Changes

Updated internal paths for my servers indy and snow (for ongoing projects whilst we can still use local files...)
Updated how get_diagnoses() and get_df() handle a user-provided file_paths object

Assets 2

23 Aug 08:18

lcpilling

v0.2.3

34e8bfd

ukbrapR v0.2.3

Bug fix

Fix for issue #8. In moving the HES ICD10 code block below the cancer registry code I acctidently put it within the if (get_canreg) { } condition. Thanks to @LauricF for highlighting.
Fix bullet points in pkgdown version of docs

Contributors

LauricF

Assets 2

21 Aug 13:04

lcpilling

v0.2.2

8834da4

ukbrapR v0.2.2

The HESIN diagnosis search can now also include ICD9 codes in the provided codes data frame. These use fuzzy matching (similar to the ICD10s) so that searching for "280" also returns "2809" etc

Also some other minor bug fixes and internal updates

Assets 2

12 Aug 08:57

lcpilling

v0.2.1

5b98701

ukbrapR v0.2.1

Fix for issue #5. The file paths for exported tables were not correctly specified in later calls of get_diagnoses() when the working directory is not the home directory. Thanks to @LauricF for highlighting.

Contributors

LauricF

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New features

Breaking changes

Bug fixes

Updates

Contributors

Bug fixes

Updates

Updates

Bug fix

Contributors

Bug fix

Changes

Bug fix

Contributors

Contributors

Releases: lcpilling/ukbrapR

ukbrapR v0.3.0

New features

Breaking changes

ukbrapR v0.2.9

Bug fixes

Updates

Contributors

ukbrapR v0.2.8

Bug fixes

Updates

ukbrapR v0.2.7

Updates

ukbrapR v0.2.6

Bug fix

Contributors

ukbrapR v0.2.5

Bug fix

ukbrapR v0.2.4

Changes

ukbrapR v0.2.3

Bug fix

Contributors

ukbrapR v0.2.2

ukbrapR v0.2.1

Contributors