-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
review deaths dates processing #987
Conversation
This uses the NRS Weekly dates and if this is blank use the chi death date. This methodology is wrong. We want to use the monthly nrs boxi date by default and chi date if there is an issue
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
@check-spelling-bot Report🔴 Please reviewSee the 📂 files view, the 📜action log, or 📝 job summary for details. Unrecognized words (14)
Previously acknowledged words that are now absentadpe canx ccyy cnws comhairle lcho matern mcbride ppas returnsthe smra 🫥To accept these unrecognized words as correct and remove the previously acknowledged and now absent words, you could run the following commands... in a clone of the [email protected]:Public-Health-Scotland/source-linkage-files.git repository curl -s -S -L 'https://raw.githubusercontent.com/check-spelling/check-spelling/main/apply.pl' |
perl - 'https://github.com/Public-Health-Scotland/source-linkage-files/actions/runs/10247484327/attempts/1' OR To have the bot accept them for you, reply quoting the following line: Available 📚 dictionaries could cover words (expected and unrecognized) not in the 📘 dictionaryThis includes both expected items (307) from .github/actions/spelling/expect.txt and unrecognized words (14)
Consider adding them (in with:
extra_dictionaries:
cspell:fullstack/dict/fullstack.txt
cspell:k8s/dict/k8s.txt
cspell:php/dict/php.txt
cspell:node/dict/node.txt
cspell:npm/dict/npm.txt To stop checking additional dictionaries, add (in check_extra_dictionaries: '' Errors (1)See the 📂 files view, the 📜action log, or 📝 job summary for details.
See ❌ Event descriptions for more information. If the flagged items are 🤯 false positivesIf items relate to a ...
|
Should be ready for review. @Jennit07 @SwiftySalmon |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems fine
Opening this as a draft PR for now to fix #986
Previously, the IT deaths extract was using the nrs weekly date but if this was missing, it would use the chi death date. We agreed as a team that this was wrong and the boxi nrs deaths date was the most reliable. This was the previous code in process_it_chi_deaths:
death_date = dplyr::coalesce(.data$death_date_nrs, .data$death_date_chi)
I have removed this and selected chi and death_date_chi so that we do not use the weekly nrs death date provided by IT. This is unvalidated and found to be unreliable. Instead, the IT chi death date is left with no modification and has two variables:
chi
anddeath_date_chi
.Later in the process we create a deaths lookup which is one row per chi for deaths that occurred within the financial year. This is used to match onto the episode file later in the process. By default, this was set up to use the boxi nrs death date which is submitted monthly and is validated. We agreed this should be the correct method going forward. However, this does not take into account the chi date. I have updated this in a way that:
if the BOXI NRS date does not match the chi death date, use the chi death date
here is the code which does this:When i checked this with 1920 data, there was only one case which changed from 31-03-2020 to 01-04-2020. This was not one of the chi's which had a 40year difference. The 40 year difference mainly was in the weekly nrs data.
We possibly need to review this to say
use the chi date if X time between the boxi nrs date and chi date
Another thing to check the methodology is on care home data which is explicitly using the nrs weekly data but if this is not available then use chi data. This is outstanding on this PR. The changes should be the same from the deaths lookup
TO DO:
opening a draft for now to see the changes.