Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Track gap filling / imputed values #80

Open
wolski opened this issue Jun 10, 2024 · 0 comments
Open

Track gap filling / imputed values #80

wolski opened this issue Jun 10, 2024 · 0 comments

Comments

@wolski
Copy link
Collaborator

wolski commented Jun 10, 2024

This use case comes from Metabolomics/Compound discoverer, but will also open the option for modeling imputed data.

  • CD reports in the Gap Status column how compound abundance was generated (e.g., Gap Fill - imputation, I guess).
  • This is reported per compound and sample.

The idea is to keep track of this information in the LFQData object,

This can be implemented as follows :

  • in addition to the response, use a gap_fill_status / nr_children column to track whether the response is observed or imputed.

Implementation:

  • use just introduced nr_children column - if Number children 0 => imputed.
  • Currently, we compute statistics about observed and missing values using the complete_cases method and counting NAs.
    Options:
  • If gap_fill_status column is present, use an alternative implementation. Otherwise, use existing.
  • Or always present, then use nr_children_column and dichotomize on 0 and not 0.

For the moment, I will go with two versions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant