Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Major rewrite #299

Open
peterdudfield opened this issue Sep 25, 2024 · 1 comment
Open

Major rewrite #299

peterdudfield opened this issue Sep 25, 2024 · 1 comment

Comments

@peterdudfield
Copy link
Collaborator

peterdudfield commented Sep 25, 2024

I thought it we are considering a major rewrite, then it might be worth say why? and what functionality we have right now.
Re writing the nwp-consumer has been a success.

Why

  • make code easier to read
  • easier to debug
  • easier to expand on
  • join together training dataset code and live consumer. Currently training dataset code is in dagster repo (TODO add link)

Requirements

  • Get 3 different satellite data, 0, 9 and 45.5.
  • For 0 and 9, get both the HRV and non-HRV data
  • Optional to scale data, to match exsisting google public dataset data
  • Dataset: Make training dataset, and merge to monthly zarrs
  • Live: Get live data
  • Live: When getting live data, save raw and process timestamp files so that we dont have to repeat this everytime the consumer runs
  • Live: Save 'latest' file, so ML models can just load that file
  • Live: If 0 degrees fails, pull 9 degree satellite
  • Live: Pull 0 degree in 5 minut intervals, pull 9 and 45.5 deg in 15 minute intervals
  • Dockerize service, so that live and dataset can use the same docker file
  • Live: Save raw data and processed data, so that we only have to pull raw data once

Optional

  • Live: For live data, with 9 degrees, use data tailor service (we might find we might not need to). If we do this, we need a way to run the data tailor clean up service too.
  • Add options for Goes satellite back in
@peterdudfield
Copy link
Collaborator Author

@devsjc could you add any other requirements you think we need

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant