-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate RAStock data #20
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a few comments.
@@ -953,6 +955,8 @@ def transform_building_features() -> DataFrame: | |||
# COMMAND ---------- | |||
|
|||
# DBTITLE 1,Apply upgrade functions | |||
# TODO: put this in some kind of shared config that can be used across srcipts/repos | |||
SUPPORTED_UPGRADES = [0.0, 1.0, 3.0, 4.0, 6.0, 9.0, 11.05, 13.01] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we perhaps want to have individual constants for these numbers. I guess or people who work with these every single day, they know what 3 and 7 mean, but for the rest of us, constants would be amazing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@nweires where are the enums that you and @christine-george are gonna use gonna live? Can import thsoe use those here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm still working out exactly what this will look like, but yes, there should be something better to use soon! I'm hoping to move away from using these upgrade ID numbers as the key to identify upgrades.
Draft changes are here - see upgrades.json and the schema in read_upgrades.py.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks Natalie! I will integrate these enums in a subsequent PR when that is ready to roll!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The parts related to reading RAStock data look good to me.
|
||
This function is getting revamped in a different repo so not bothering to document for now | ||
""" | ||
# TODO: update with new code from natalie/mohammad in pep |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This changed already in https://github.com/rewiringamerica/pep/pull/187, but if you want to wait for it to finish changing, this is fine for now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, i saw that but I thought I'd wait till the upgrade id table was ready and the cchp schema changes were in and then make all the changes in one go.
@@ -953,6 +955,8 @@ def transform_building_features() -> DataFrame: | |||
# COMMAND ---------- | |||
|
|||
# DBTITLE 1,Apply upgrade functions | |||
# TODO: put this in some kind of shared config that can be used across srcipts/repos | |||
SUPPORTED_UPGRADES = [0.0, 1.0, 3.0, 4.0, 6.0, 9.0, 11.05, 13.01] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm still working out exactly what this will look like, but yes, there should be something better to use soon! I'm hoping to move away from using these upgrade ID numbers as the key to identify upgrades.
Draft changes are here - see upgrades.json and the schema in read_upgrades.py.
Co-authored-by: mikivee <mikivee>
Summary
Adds selected RAStock upgrades to the surrogate model training set. This does NOT retrain the model. Ticket
Detailed Changes
util.py
andconstants.py
script in preparation for a move to dmutils. Note that some of this code will change after latest changes are merged into PEP.extract_data
to use the functions in (1) to pull in all RAStock annual outputs and combine with ResStock outputs.build_feature_store.py
datagen.py
to make sure we train on only subset of building ids, since we don't want the train on the new upgrades for right now.n_building_units
feature inbuild_feature_store.py
updated from str to int.Next
Move util and constants and tests over to dmutils and once this repo is fully migrated over, delete from here and import from there instead.
Who Reviews