Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weekly data quality checks for mart updates #267

Open
jrlegrand opened this issue Mar 9, 2024 · 3 comments
Open

Weekly data quality checks for mart updates #267

jrlegrand opened this issue Mar 9, 2024 · 3 comments

Comments

@jrlegrand
Copy link
Member

jrlegrand commented Mar 9, 2024

Depends on #262 and to a lesser extent #266

Problem Statement

Without manual analysis, it's hard to know what changed week to week in our data marts.

Criteria for Success

NDC Description mart

  • How many new NDCs
  • How many retired NDCs
  • How many NDC->RXCUI mappings changed from previous week?
  • How many new FDA descriptions
  • How many changed FDA descriptions
  • How many retired FDA descriptions
  • How many new RxNorm descriptions
  • How many changed RxNorm descriptions
  • How many retired RxNorm descriptions

ATC to RXCUI mart

  • How may new RXCUIs
  • How many retired RXCUIs
  • How many RXCUI->ATC4 mappings changed from previous week?
  • Did anything change with ATC1-4 codes from previous week? Would expect this to only change once yearly

Additional Information

We considered adding a task to the mart DAG that saves off an old version of the file and then does a diff and creates a table with this analysis. Or something like this.

Link to historical mart flatfiles: https://drive.google.com/drive/folders/1IMotvyde-TptEOzRlCFouiWV00z4roRW?usp=sharing

@jrlegrand
Copy link
Member Author

@leemlb06pmi / @Komal77rao - I can get you the past couple of weeks of data mart flatfiles if that would be helpful for this issue - let me know when you're ready.

@leemlb06pmi
Copy link
Contributor

@leemlb06pmi / @Komal77rao - I can get you the past couple of weeks of data mart flatfiles if that would be helpful for this issue - let me know when you're ready.

ok that's great - baseline files will definitely be useful when we get to this point. thanks!

@jrlegrand
Copy link
Member Author

@jrlegrand jrlegrand removed the interns label May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

2 participants