Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closes #2481 bug the result of derive param tte depends on the sort order of the input #2569

Open
wants to merge 32 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 30 commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
8e23388
Added order arguments to censor_source and event_source. Also added s…
ProfessorP-beep Nov 18, 2024
cd52801
Added order argument to tte_source as part of development and error f…
ProfessorP-beep Nov 18, 2024
2727736
Fixed previous erros but still need to address failed tests for Test …
ProfessorP-beep Nov 18, 2024
9e86217
added check_type arg_match to derive_param_tte so user has to input a…
ProfessorP-beep Nov 18, 2024
d97377c
Changed position of signal_duplicate_records function in derive_param…
ProfessorP-beep Nov 18, 2024
fa49a51
lintr changes by removing whitespace.
ProfessorP-beep Nov 18, 2024
01e8f5a
styler fix.
ProfessorP-beep Nov 18, 2024
53457c2
updated NEWS.md with changes to derive_param_tte,. Removed Test 17 fr…
ProfessorP-beep Nov 19, 2024
020c9d7
Merge branch 'main' into 2481-bug-the-result-of-derive_param_tte-depe…
ProfessorP-beep Nov 19, 2024
dccdbe1
changed the signal_duplicate_records within derive_parame_tte to hand…
ProfessorP-beep Nov 19, 2024
8006891
Merge branch '2481-bug-the-result-of-derive_param_tte-depends-on-the-…
ProfessorP-beep Nov 19, 2024
4c95243
added a tryCatch() to filter_date_sources to catch duplicates to addr…
ProfessorP-beep Nov 21, 2024
087c0f3
Moved duplication check to filter_date_sources in tryCatch() and rewr…
ProfessorP-beep Nov 24, 2024
4405868
1. Moved updates in News section to admiral dev section
ProfessorP-beep Dec 3, 2024
21b5a00
Ran styler, lintr fixes, and devtools check.
ProfessorP-beep Dec 3, 2024
ce07ad1
styler changes
ProfessorP-beep Dec 3, 2024
1d4e6b7
accepted snapshots from testthat and addressed bds_tte.Rmd error for …
ProfessorP-beep Dec 3, 2024
22f3f2d
added documentation for order and check_type arguments added to funct…
ProfessorP-beep Dec 3, 2024
47637a5
requested updates to documentation and test script for derive_param_tte
ProfessorP-beep Dec 16, 2024
e882758
corrected documentation and removed rlang from bds_tte.Rmd
ProfessorP-beep Dec 17, 2024
e5c28fc
updated derive_param_tte documentation and added test to derive_param…
ProfessorP-beep Dec 20, 2024
404c949
fixed spelling error
ProfessorP-beep Dec 20, 2024
ae70492
updates to derive_param_tte documentation and test examples.
ProfessorP-beep Dec 23, 2024
34d2fb3
Update NEWS.md
ProfessorP-beep Jan 8, 2025
65c58ee
Merge branch 'main' into 2481-bug-the-result-of-derive_param_tte-depe…
bms63 Jan 9, 2025
2a3cf6c
update to derive_param_tte test, function examples, and documentation.
ProfessorP-beep Jan 9, 2025
dbbb5ab
Merge branch '2481-bug-the-result-of-derive_param_tte-depends-on-the-…
ProfessorP-beep Jan 9, 2025
1cb81bc
snapshots accepted
ProfessorP-beep Jan 9, 2025
2aeaf29
passed local checks. Pushing again
ProfessorP-beep Jan 10, 2025
a95aa68
ran styler
ProfessorP-beep Jan 10, 2025
0a1d621
added "message" as a option for check_type in derive_var_obs_number
ProfessorP-beep Jan 10, 2025
e94e3eb
Update NEWS.md
ProfessorP-beep Jan 11, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 6 additions & 0 deletions NEWS.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,12 @@ target range. (#2571)
- Update `ADEG` template to flag `ABLFL` and `ANL01FL` based on `DTYPE == "AVERAGE"` records. (#2561)

## Updates of Existing Functions
- added `"message"` as option for `check_type` argument in `filter_extreme()` function. (#2481)

- Users can now specify how duplicate records are handled in `derive_param_tte()` using the `check_type` argument, with options including `"error"`, `"warning"`, `"message"`, or `"none"`, allowing for greater flexibility in managing duplicate data scenarios. (#2481)

- `order` argument has been added to `event_source()` and `censor_source()` and
defaulted to `NULL` to allow specifying variables in addition to the date variable. This can be used to ensure the uniqueness of the select records if there is more than one record per date. (#2481)

- NCICTCAEv5 grading criteria (`atoxgr_criteria_ctcv5`):

Expand Down
8 changes: 4 additions & 4 deletions R/derive_merged.R
Original file line number Diff line number Diff line change
Expand Up @@ -112,15 +112,15 @@
#'
#' @param check_type Check uniqueness?
#'
#' If `"warning"` or `"error"` is specified, the specified message is issued
#' If `"warning"`, `"message"`, or `"error"` is specified, the specified message is issued
#' if the observations of the (restricted) additional dataset are not unique
#' with respect to the by variables and the order.
#'
#' If the `order` argument is not specified, the `check_type` argument is ignored:
#' if the observations of the (restricted) additional dataset are not unique with respect
#' to the by variables, an error is issued.
#' if the observations of the (restricted) additional dataset are not unique with respect
#' to the by variables, an error is issued.
#'
#' *Permitted Values*: `"none"`, `"warning"`, `"error"`
#' *Permitted Values*: `"none"`, `"message"`, `"warning"`, `"error"`
#'
#' @param duplicate_msg Message of unique check
#'
Expand Down
159 changes: 129 additions & 30 deletions R/derive_param_tte.R
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,15 @@
#'
#' A list of symbols created using `exprs()` is expected.
#'
#' @param check_type Check uniqueness
#'
#' If `"warning"`, `"message"`, or `"error"` is specified, the specified message is issued
#' if the observations of the source datasets are not unique with respect to the
#' by variables and the date and order specified in the `event_source()` and
#' `censor_source()` objects.
#'
#' *Permitted Values*: `"none"`, `"message"`, `"warning"`, `"error"`
#'
#' @details The following steps are performed to create the observations of the
#' new parameter:
#'
Expand Down Expand Up @@ -262,10 +271,10 @@
#' mutate(STUDYID = "AB42")
#'
#' ae <- tribble(
#' ~USUBJID, ~AESTDTC, ~AESEQ, ~AEDECOD,
#' "01", "2021-01-03T10:56", 1, "Flu",
#' "01", "2021-03-04", 2, "Cough",
#' "01", "2021", 3, "Flu"
#' ~USUBJID, ~AESTDTC, ~AESEQ, ~AEDECOD,
#' "01", "2021-01-03T10:56", 1, "Flu",
#' "01", "2021-03-04", 2, "Cough",
#' "01", "2021", 3, "Flu"
#' ) %>%
#' mutate(STUDYID = "AB42")
#'
Expand Down Expand Up @@ -313,6 +322,50 @@
#' )
#' ) %>%
#' select(USUBJID, STARTDT, PARAMCD, PARAM, ADT, CNSR, SRCSEQ)
#'
#' # Resolve tie when serious AE share a date by sorting with order argument
#' adsl <- tribble(
#' ~USUBJID, ~TRTSDT, ~EOSDT,
#' "01", ymd("2020-12-06"), ymd("2021-03-06"),
#' "02", ymd("2021-01-16"), ymd("2021-02-03")
#' ) %>% mutate(STUDYID = "AB42")
#'
#' ae <- tribble(
#' ~USUBJID, ~AESTDTC, ~AESEQ, ~AESER, ~AEDECOD,
#' "01", "2021-01-03", 1, "Y", "Flu",
#' "01", "2021-01-03", 2, "Y", "Cough",
#' "01", "2021-01-20", 3, "N", "Headache",
#' ) %>% mutate(
#' AESTDT = ymd(AESTDTC),
#' STUDYID = "AB42"
#' )
#'
#' derive_param_tte(
#' dataset_adsl = adsl,
#' start_date = TRTSDT,
#' source_datasets = list(adsl = adsl, ae = ae),
#' event_conditions = list(event_source(
#' dataset_name = "ae",
#' date = AESTDT,
#' set_values_to = exprs(
#' EVENTDESC = "Serious AE",
#' SRCSEQ = AESEQ
#' ),
#' filter = AESER == "Y",
#' order = exprs(AESEQ)
#' )),
#' censor_conditions = list(censor_source(
#' dataset_name = "adsl",
#' date = EOSDT,
#' censor = 1,
#' set_values_to = exprs(EVENTDESC = "End of Study")
#' )),
#' set_values_to = exprs(
#' PARAMCD = "TTSAE",
#' PARAM = "Time to First Serious AE"
#' )
#' )
#'
derive_param_tte <- function(dataset = NULL,
dataset_adsl,
source_datasets,
Expand All @@ -322,8 +375,14 @@ derive_param_tte <- function(dataset = NULL,
censor_conditions,
create_datetime = FALSE,
set_values_to,
subject_keys = get_admiral_option("subject_keys")) {
subject_keys = get_admiral_option("subject_keys"),
check_type = "warning") {
# checking and quoting #
check_type <- assert_character_scalar(
check_type,
values = c("warning", "message", "error", "none"),
case_sensitive = FALSE
)
assert_data_frame(dataset, optional = TRUE)
assert_vars(by_vars, optional = TRUE)
start_date <- assert_symbol(enexpr(start_date))
Expand Down Expand Up @@ -373,16 +432,17 @@ derive_param_tte <- function(dataset = NULL,
by_vars = by_vars
)
}

tmp_event <- get_new_tmp_var(dataset)

# determine events #
event_data <- filter_date_sources(
sources = event_conditions,
source_datasets = source_datasets,
by_vars = by_vars,
create_datetime = create_datetime,
subject_keys = subject_keys,
mode = "first"
mode = "first",
check_type = check_type
) %>%
mutate(!!tmp_event := 1L)

Expand All @@ -393,7 +453,8 @@ derive_param_tte <- function(dataset = NULL,
by_vars = by_vars,
create_datetime = create_datetime,
subject_keys = subject_keys,
mode = "last"
mode = "last",
check_type = check_type
) %>%
mutate(!!tmp_event := 0L)

Expand Down Expand Up @@ -436,7 +497,8 @@ derive_param_tte <- function(dataset = NULL,
bind_rows(event_data, censor_data),
by_vars = expr_c(subject_keys, by_vars),
order = exprs(!!tmp_event),
mode = "last"
mode = "last",
check_type = check_type
) %>%
inner_join(
adsl,
Expand Down Expand Up @@ -505,6 +567,15 @@ derive_param_tte <- function(dataset = NULL,
#'
#' Permitted Values: `"first"`, `"last"`
#'
#' @param check_type Check uniqueness
#'
#' If `"warning"`, `"message"`, or `"error"` is specified, the specified message is issued
#' if the observations of the source datasets are not unique with respect to the
#' by variables and the date and order specified in the `tte_source()` objects.
#'
#' Default: `"none"`
#' Permitted Values: `"none"`, `"warning"`, `"error"`, `"message"`
#'
ProfessorP-beep marked this conversation as resolved.
Show resolved Hide resolved
#' @details The following steps are performed to create the output dataset:
#'
#' \enumerate{ \item For each source dataset the observations as specified by
Expand All @@ -529,7 +600,7 @@ derive_param_tte <- function(dataset = NULL,
#' @return A dataset with one observation per subject as described in the
#' "Details" section.
#'
#' @noRd
#' @keywords internal
ProfessorP-beep marked this conversation as resolved.
Show resolved Hide resolved
#'
#' @examples
#' library(tibble)
Expand Down Expand Up @@ -565,20 +636,22 @@ derive_param_tte <- function(dataset = NULL,
#' )
#' )
#'
#' filter_date_sources(
#' admiral:::filter_date_sources(
#' sources = list(ttae),
#' source_datasets = list(adsl = adsl, ae = ae),
#' by_vars = exprs(AEDECOD),
#' create_datetime = FALSE,
#' subject_keys = get_admiral_option("subject_keys"),
#' mode = "first"
#' mode = "first",
#' check_type = "none"
#' )
filter_date_sources <- function(sources,
source_datasets,
by_vars,
create_datetime = FALSE,
subject_keys,
mode) {
mode,
check_type = "none") {
ProfessorP-beep marked this conversation as resolved.
Show resolved Hide resolved
assert_list_of(sources, "tte_source")
assert_list_of(source_datasets, "data.frame")
assert_logical_scalar(create_datetime)
Expand Down Expand Up @@ -613,17 +686,31 @@ filter_date_sources <- function(sources,
var = !!source_date_var,
dataset_name = sources[[i]]$dataset_name
)
data[[i]] <- source_dataset %>%
filter_if(sources[[i]]$filter) %>%
filter_extreme(
order = exprs(!!source_date_var),
by_vars = expr_c(subject_keys, by_vars),
mode = mode,
check_type = "none"
)

# wrap filter_extreme in tryCatch to catch duplicate records and create a message
data[[i]] <- rlang::try_fetch(
{
source_dataset %>%
filter_if(sources[[i]]$filter) %>%
filter_extreme(
order = expr_c(exprs(!!source_date_var), sources[[i]]$order),
by_vars = expr_c(subject_keys, by_vars),
mode = mode,
check_type = check_type
ProfessorP-beep marked this conversation as resolved.
Show resolved Hide resolved
)
},
duplicate_records = function(cnd) {
cnd_funs <- list(message = cli_inform, warning = cli_warn, error = cli_abort)
cnd_funs[[check_type]](
paste(
"Dataset {.val {sources[[i]]$dataset_name}} contains duplicate records with respect to",
"{.var {cnd$by_vars}}"
),
class = class(cnd))
cnd_muffle(cnd)
zap()
}
)
# add date variable and accompanying variables

if (create_datetime) {
date_derv <- exprs(!!date_var := as_datetime(!!source_date_var))
} else {
Expand All @@ -649,7 +736,7 @@ filter_date_sources <- function(sources,
by_vars = expr_c(subject_keys, by_vars),
order = exprs(!!date_var),
mode = mode,
check_type = "none"
check_type = check_type
)
}

Expand Down Expand Up @@ -782,6 +869,12 @@ extend_source_datasets <- function(source_datasets,
#' SRCDOM = "ADSL", SRCVAR = "DTHDT")`. The values must be a symbol, a
#' character string, a numeric value, an expression, or `NA`.
#'
#' @param order Sort order
#'
#' An optional named list returned by `exprs()` defining additional variables
#' that the source dataset is sorted on after `date`.
#'
#' *Permitted Values:* list of variables created by `exprs()` e.g. `exprs(ASEQ)`.
#'
#' @keywords source_specifications
#' @family source_specifications
Expand All @@ -793,7 +886,8 @@ tte_source <- function(dataset_name,
filter = NULL,
date,
censor = 0,
set_values_to = NULL) {
set_values_to = NULL,
order = order) {
out <- list(
dataset_name = assert_character_scalar(dataset_name),
filter = assert_filter_cond(enexpr(filter), optional = TRUE),
Expand All @@ -803,7 +897,8 @@ tte_source <- function(dataset_name,
set_values_to,
named = TRUE,
optional = TRUE
)
),
order = order
)
class(out) <- c("tte_source", "source", "list")
out
Expand Down Expand Up @@ -844,13 +939,15 @@ tte_source <- function(dataset_name,
event_source <- function(dataset_name,
filter = NULL,
date,
set_values_to = NULL) {
set_values_to = NULL,
order = NULL) {
out <- tte_source(
dataset_name = assert_character_scalar(dataset_name),
filter = !!enexpr(filter),
date = !!assert_expr(enexpr(date)),
censor = 0,
set_values_to = set_values_to
set_values_to = set_values_to,
order = order
)
class(out) <- c("event_source", class(out))
out
Expand Down Expand Up @@ -891,13 +988,15 @@ censor_source <- function(dataset_name,
filter = NULL,
date,
censor = 1,
set_values_to = NULL) {
set_values_to = NULL,
order = NULL) {
out <- tte_source(
dataset_name = assert_character_scalar(dataset_name),
filter = !!enexpr(filter),
date = !!assert_expr(enexpr(date)),
censor = assert_integer_scalar(censor, subset = "positive"),
set_values_to = set_values_to
set_values_to = set_values_to,
order = order
)
class(out) <- c("censor_source", class(out))
out
Expand Down
2 changes: 1 addition & 1 deletion R/filter_extreme.R
Original file line number Diff line number Diff line change
Expand Up @@ -106,7 +106,7 @@ filter_extreme <- function(dataset,
check_type <-
assert_character_scalar(
check_type,
values = c("none", "warning", "error"),
values = c("none", "warning", "error", "message"),
case_sensitive = FALSE
)
assert_data_frame(dataset, required_vars = by_vars)
Expand Down
1 change: 1 addition & 0 deletions inst/WORDLIST
Original file line number Diff line number Diff line change
Expand Up @@ -312,6 +312,7 @@ msec
nd
occds
onwards
param
ProfessorP-beep marked this conversation as resolved.
Show resolved Hide resolved
parttime
pharmaverse
pharmaverseadam
Expand Down
10 changes: 9 additions & 1 deletion man/censor_source.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading
Loading