Feat/test data for lab and health insurance #19

signekb · 2023-12-08T11:54:57Z

This pr includes the following:

The addition of create_test_lab_df() in create_test_data.R
The creation of lab_df in create_test_data.R using the create_test_lab_df(). lab_df has 100 rows (1 row per individual)
The addition of a create_test_health_insurance_df() function in create_test_data.R
The creation of health_insurance_df in in create_test_data.R using the create_test_health_insurance_df(). health_insurance_df has 100 rows (1 row per individual)

Can you check whether these dfs follow your descriptions in #4, @Aastedet?
From the description, I'm not sure whether there should be multiple rows per individual? Currently, both functions sample from 001-100 with replacement, meaning that with 100 samples some ID's will appear multiple times and some ID's in the range will not appear. Is that what you imagined?

Do we maybe want to keep the test data functions and creation of the test data in separate scripts?

…les > 100

this way it's clear that we set it for all test datasets and not only medication data

…est_data.R and delete empty functions.R

#Conflicts: # DESCRIPTION

data-raw/testdata.R

lwjohnst86 · 2023-12-12T19:02:05Z

data-raw/testdata.R

+    replace = TRUE
+  )),
+  # Number of packages
+  apk = sample(1:3, 1000, replace = TRUE),


What is this for?

number of packages ("antal pakker", the slighly cryptic variable name on dst is apk) purchased is factored in when calculating the number of doses of insulin vs. non-insulin when classifying type 1 from type 2 diabetes.

data-raw/testdata.R

lwjohnst86 · 2023-12-12T19:33:50Z

In general, for creating data used within the package, the code to create it should be not included as part of the package (since it might include dependencies that aren't actually needed by the package for intended uses). That's why I moved the code over into data-raw/ (set up using the usethis::use_data_raw()).

signekb · 2023-12-13T10:24:10Z

@lwjohnst86 Your comments seem to be mostly on the medication test data, which is actually not a part of this PR (test data for lab and health insurance).
But I guess @Aastedet can take a look at the comments, since I don't really have an overview of what's happening in that part of the script either :)
I will add @Aastedet as assignee for this PR (assignee = actively working on the PR and being responsible for getting it into a merge-ready state).

lwjohnst86 · 2023-12-13T13:24:06Z

@signekb I didn't realize until later that it was added by Anders (?) earlier, since the PR showed it all coming from you ☺️ though I did know that you hadn't added that code when I made the comments because of reading the #4 issue ☺️

signekb · 2023-12-13T13:57:36Z

Totally fine - I understand the confusion 👍

signekb · 2024-01-31T12:11:01Z

@Aastedet @lwjohnst86 Status on this? Anything I can do for this to become ready to merge?

lwjohnst86 · 2024-02-01T20:01:43Z

@signekb We'll get to this when we start the focus period next week. More likely it will be me and you figuring things out and getting feedback from @Aastedet during meetings.

added assign_drugname_from_atc() to med_a10_df

Fix to previous commit to assign drugnames to med_a10_df

forgot to actually assign drug names to med_a10_df

data-raw/testdata.R

- Added offset to pnr number generation to have more control when generating data for false-positive diabetes cases (for medication: 1-200: non-cases, 201-250: true cases). - Increased number of samples in health insurance/lab data and changed years covered by health insurance to match real world setting.

Merge commit '61343b054d2a190eb1e20de6bcb1265c10d2ac34' #Conflicts: # .Rbuildignore # DESCRIPTION

Aastedet

This looks good! Next step is for me to add the hospital diagnosis (lpr) and population data (bef).

Aastedet · 2024-02-20T13:07:35Z

Are we waiting for me to add the fake diagnosis data (I will, I promise 😄 ) before merging or do I need to do more in terms of reviewing the PR?

lwjohnst86 · 2024-02-20T15:28:39Z

@Aastedet there are still a lot of questions I have and the code needs a lot of work, that's why I haven't merged it in yet. I think it would be better to first do #32 before creating the example/test data, because it would make it easier to build those if we have the variable list well defined and set up first.

Aastedet · 2024-02-22T12:49:03Z

@lwjohnst86 Cool - I'll get #32 done ASAP.

signekb added 3 commits December 6, 2023 20:17

feat: add functions script with create_test_lab_df()

652aaca

feat: create test lab_df using create_test_lab_df()

e6baf5b

fix: change pnr to only include 001-100 independent of num_samples

d930fc3

signekb linked an issue Dec 8, 2023 that may be closed by this pull request

Create a fake dataset to test that the functions work #4

Closed

signekb marked this pull request as draft December 8, 2023 11:56

signekb added 6 commits December 8, 2023 13:11

style: edit pnr comment to clarify it's only 001-100 even if num_samp…

88f0b40

…les > 100

style: update comments in create_test_lab_df

3c04bd8

feat: add create_test_hi_df()

a6e07a8

feat: create test health insurance df using. create_test_hi_df()

bbb2910

style: remove old parenthesis from comment

17e40d2

fix: move set.seed up

e09b902

this way it's clear that we set it for all test datasets and not only medication data

signekb requested a review from Aastedet December 8, 2023 13:04

signekb self-assigned this Dec 8, 2023

signekb marked this pull request as ready for review December 8, 2023 13:05

signekb and others added 3 commits December 10, 2023 13:21

refactor: move functions to create test lab and hi data into create_t…

ea8e089

…est_data.R and delete empty functions.R

chore: add setup for making fake data using usethis::use_data_raw().

2fc5655

chore: Moved code over into data-raw folder

17dfea6

lwjohnst86 mentioned this pull request Dec 12, 2023

Create a fake dataset to test that the functions work #4

Closed

lwjohnst86 added 3 commits December 12, 2023 20:00

refactor: Started refactoring but not sure what output should be.

4eb8947

Merge commit 'bb0cb57ecd28ec4c96f98412926174c1cc74d25e'

81594c2

#Conflicts: # DESCRIPTION

refactor: create function to make pnr, plus other small edits

d35dc94

lwjohnst86 requested changes Dec 12, 2023

View reviewed changes

signekb assigned Aastedet Dec 13, 2023

Update testdata.R

f4e242e

added assign_drugname_from_atc() to med_a10_df

Aastedet added 2 commits February 13, 2024 12:05

Update testdata.R

cb9100f

Fix to previous commit to assign drugnames to med_a10_df

Update testdata.R

f5395c6

forgot to actually assign drug names to med_a10_df

signekb commented Feb 16, 2024

View reviewed changes

data-raw/testdata.R Outdated Show resolved Hide resolved

Anders Aasted Isaksen added 2 commits February 17, 2024 23:30

Fixed merge conflicts after pull from main

387454a

Merge commit '61343b054d2a190eb1e20de6bcb1265c10d2ac34' #Conflicts: # .Rbuildignore # DESCRIPTION

Aastedet approved these changes Feb 17, 2024

View reviewed changes

Aastedet requested a review from lwjohnst86 February 17, 2024 23:03

lwjohnst86 merged commit b645de4 into main Apr 27, 2024
0 of 2 checks passed

lwjohnst86 deleted the feat/test-data-for-lab-and-health-insurance branch April 27, 2024 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/test data for lab and health insurance #19

Feat/test data for lab and health insurance #19

signekb commented Dec 8, 2023 •

edited

Loading

lwjohnst86 Dec 12, 2023

Aastedet Feb 17, 2024

lwjohnst86 commented Dec 12, 2023

signekb commented Dec 13, 2023

lwjohnst86 commented Dec 13, 2023

signekb commented Dec 13, 2023

signekb commented Jan 31, 2024

lwjohnst86 commented Feb 1, 2024

Aastedet left a comment

Aastedet commented Feb 20, 2024

lwjohnst86 commented Feb 20, 2024

Aastedet commented Feb 22, 2024

Feat/test data for lab and health insurance #19

Feat/test data for lab and health insurance #19

Conversation

signekb commented Dec 8, 2023 • edited Loading

lwjohnst86 Dec 12, 2023

Choose a reason for hiding this comment

Aastedet Feb 17, 2024

Choose a reason for hiding this comment

lwjohnst86 commented Dec 12, 2023

signekb commented Dec 13, 2023

lwjohnst86 commented Dec 13, 2023

signekb commented Dec 13, 2023

signekb commented Jan 31, 2024

lwjohnst86 commented Feb 1, 2024

Aastedet left a comment

Choose a reason for hiding this comment

Aastedet commented Feb 20, 2024

lwjohnst86 commented Feb 20, 2024

Aastedet commented Feb 22, 2024

signekb commented Dec 8, 2023 •

edited

Loading