Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad data for Solano county cases on 2020-09-02 #174

Open
Mr0grog opened this issue Jan 21, 2021 · 2 comments
Open

Bad data for Solano county cases on 2020-09-02 #174

Mr0grog opened this issue Jan 21, 2021 · 2 comments
Labels
bug Something isn't working source: arcgis

Comments

@Mr0grog
Copy link
Collaborator

Mr0grog commented Jan 21, 2021

The timeseries Solano county cases has an erroneous, huge bump in the data on September 2, 2020:

{
  "cases": [
    // ...
    {
      "date": "2020-09-01",
      "cumul_cases": 5547
    },
    {
      "date": "2020-09-02",
      "cumul_cases": 25806
    },
    {
      "date": "2020-09-10",
      "cumul_cases": 5778
    },
    // ...
  ]
}

This is coming from the underlying data source (https://www.arcgis.com/home/webmap/viewer.html?url=https://services2.arcgis.com/SCn6czzcqKAFwdGU/ArcGIS/rest/services/COVID19Surveypt1v3_view/FeatureServer&source=sd):

Screen Shot 2021-01-21 at 2 44 40 PM

We should figure out a way to smooth this out, whether by hardcoding a manual correction for that date, pulling data from the state, or something else.

@Mr0grog
Copy link
Collaborator Author

Mr0grog commented Jan 21, 2021

Looks like we have this problem for deaths and tests, too:

{
  "tests": [
    // ...
    {
      "date": "2020-09-01",
      "cumul_tests": 75559
    },
    {
      "date": "2020-09-02",
      "cumul_tests": 188850
    },
    {
      "date": "2020-09-10",
      "cumul_tests": 79112
    },
    // ...
  ],
  "deaths": [
    // ...
    {
      "date": "2020-09-01",
      "cumul_deaths": 47
    },
    {
      "date": "2020-09-02",
      "cumul_deaths": 106
    },
    {
      "date": "2020-09-10",
      "cumul_deaths": 48
    },
    // ...
  ]
}

@Mr0grog
Copy link
Collaborator Author

Mr0grog commented Jan 21, 2021

And this isn't showing up in their dashboard, which is based off this data:

Screen Shot 2021-01-21 at 2 52 51 PM

It looks like it’s reading a different field than we are:

  • Chart on dashboard: Number of specimens collected on this date
  • Us: Cumulative number of cases on the date reported to Solano public health

So we should maybe just be looking at different fields. That doesn’t solve the issues for deaths and tests, though. ¯\_(ツ)_/¯

@Mr0grog Mr0grog added bug Something isn't working source: arcgis labels Jan 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working source: arcgis
Projects
None yet
Development

No branches or pull requests

1 participant