Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#feature: Create a Generalized Linkage Script #22

Open
J-glove opened this issue Aug 28, 2024 · 11 comments
Open

#feature: Create a Generalized Linkage Script #22

J-glove opened this issue Aug 28, 2024 · 11 comments
Assignees
Labels
feature new code/logic to be written library has to do with the library portion of this project

Comments

@J-glove
Copy link
Collaborator

J-glove commented Aug 28, 2024

@chengrong-us , using the script you wrote "linkage_ucr_sql.py" as a guide, develop of a linkage script which does not depend on the exposome data.

For now - it does NOT need to read from SQL.

You can write it to read the data from a file.

@J-glove J-glove added feature new code/logic to be written library has to do with the library portion of this project labels Aug 28, 2024
@J-glove J-glove changed the title #feature #feature: Create a Generalized Linkage Script Aug 28, 2024
@chengrong-us
Copy link
Collaborator

@J-glove I've successfully completed building the scripts, and they are running smoothly for UCR and CACES. However, we might have to double-check the accuracy of the results and verify them with the remaining exposome datasets.

@J-glove
Copy link
Collaborator Author

J-glove commented Sep 11, 2024

@chengrong-us , Please push the script upstream with PR.

@J-glove
Copy link
Collaborator Author

J-glove commented Sep 11, 2024

No need to push upstream. @chengrong-us still needs to verify accuracy and execution of remaining exposome datasets.

Push upstream once results are verified.

Suggestion: Make a script to run each dataset against the linkage script and save the result.

@allfires will provide location of linkage ready datasets on the exposome server to @chengrong-us.

@chengrong-us
Copy link
Collaborator

  1. FARA, and NATA:
    • The linkage for FARA, and NATA has been successfully verified and is working as expected.
  2. HUD:
    • The HUD linkage step is functioning correctly; however, there is currently yielding no results. This issue may be related to the days' calculation by year and quarter.
  3. WI Preprocessing Issue:
    • There seems to be a potential issue with the WI preprocessing. The dataset might not include data for Florida (FL).

@J-glove
Copy link
Collaborator Author

J-glove commented Sep 18, 2024

Due to Chengrong's description of arisen issues, we will move on to those bug tickets and move this to backlog.

@chengrong-us
Copy link
Collaborator

@J-glove Fixed HUD bugs.
ACS, ACAG, CBP/ZBP, WI still need to be verified.

@J-glove
Copy link
Collaborator Author

J-glove commented Oct 2, 2024

@chengrong-us , Now that the walkability index script is corrected, please attempt to run this script on WI. This is more important than the initial driver script. (#32)

@allfires, Please regenerate the new WALKABILITY file for Chengrong to test, notify him when it is finished.

@chengrong-us
Copy link
Collaborator

@J-glove there are still issues in WI preprocessing. It seems that the inconsistency between GEOID10 (e.g., '120910000000') in formatted dataset and the GEOID10 (e.g., '120299701021') in the buffer. The GEOID in formatted dataset might be county level or truncated instead of block group level.

@J-glove
Copy link
Collaborator Author

J-glove commented Oct 8, 2024

It seems that this ticket is now blocked by issues in the preprocessing pipeline. Walkability has issues per @chengrong-us and ACAG formatter it not made yet as of 10/08/24. This ticket will be moved to backlog not ready.

@chengrong-us
Copy link
Collaborator

@J-glove the preprocessing and linkage scripts works well against WI.
Update: ACS, ACAG, CBP/ZBP still need to be verified.

@J-glove
Copy link
Collaborator Author

J-glove commented Oct 30, 2024

Per @chengrong-us , the verification of this script is blocked by development of acag and cbp datasets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature new code/logic to be written library has to do with the library portion of this project
Projects
None yet
Development

No branches or pull requests

2 participants