Skip to content

Atlas-B2023/ResidentialElectrificationTracker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

97 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DISCLAIMER

Utilizing Redfin's data for commercial use is prohibited. MLS data may be protected by copyright laws. Program authors do not take responsibility for any action Redfin or its MLS brokers take against users of this software. Redfin data is not prepackaged in this repository.

(Not sure if we need a license file? Consider as source available, and free to modify, and not free to sell)

Installation Directions

  1. Download Python 3.12 from the official website (the big button usually works).
  2. Download this repository, either by cloning or by downloading the ZIP file.
  3. Run the included setup.bat.
  4. Thats it!

The installation steps are covered in the YouTube video series on the documentation page as well.

How to Run the program

In order to run this program, two API keys are required. The instructions on how to get these are listed in the Energy and Census sections.

Once these have been acquired, create a file named .env in the src\ folder.

In this file, write CENSUS_API_KEY= and EIA_API_KEY= on the first two lines, and after the equals sign paste in your key for each entry.

To start the application, double click on the run.bat file. This should start the application, and now you can use it!

Paid APIS

There are many paid APIs allow commercial use.

Energy

Energy prices are through the EIA's open data API.

Get your API key here: https://www.eia.gov/opendata/register.php

https://www.eia.gov/opendata/pdf/EIA-APIv2-HandsOn-Webinar-11-Jan-23.pdf https://www.eia.gov/opendata/?category=0 https://www.eia.gov/opendata/documentation.php

Census

Census data is collected through their API. We use the ACS 5 year profile and subject tables DP05 and S1901, respectively.

If specified criteria are met, a metropolitan statistical area containing a single core with a population of 2.5 million or more may be subdivided to form smaller groupings of counties referred to as "metropolitan divisions."

Get your API key here: https://api.census.gov/data/key_signup.html

https://www.census.gov/programs-surveys/metro-micro/about/glossary.html

https://www.nber.org/research/data/census-core-based-statistical-area-cbsa-federal-information-processing-series-fips-county-crosswalk

Misc

Each request from Redfin takes about 4.5 seconds to GET and analyze. This is intentional so as to not flood Redfin's servers.

Output

Output is organized as such:

output/
├── metro_data/
│   ├── <Metro_name>/
│       ├── <zip>.csv
│   ├── <Other_metro_name>/
│       ├── <otherzip>.csv
├── census_data/
│   ├── cache/
│       ├── <chache>.json
│   ├── <acs5>.csv
├── logging/
│     ├── logging.log

Warning

If you are running metros that share zip codes, the same zip code will be imported twice. This is an improvement that can be done by checking what zip codes have been loaded in the sister repository.

Sources

Census data:

DP05 and S1901 tables from the American Community Survey 5 year, vintage 2019, https://www.census.gov/data/developers/data-sets/acs-5year.html.

Get your API key from: https://api.census.gov/data/key_signup.html.

Energy price data:

Energy Information Administration, https://www.eia.gov/opendata/.

Get your API key from the right hand side by clicking the register button

Augmenting data:

This folder contains a collection of files that help with geo location information.

Housing data:

Redfin's stingray API

Improvements

Create a fully python app. Plotly has native python support for all things spacial analysis. https://plotly.com/python/mapbox-county-choropleth/. Additionally, using Dash https://dash.plotly.com/ might make the experience more seamless

Add tool bar for graphs. Use mplcursors for hover on the graph.

Use census library from python: table prefixes for DP and S tables are acs5dp and acs5st in the code. This relies on a library that has not been updated to python 3.12, so we didn't use it.

Use database to store information, and implement caching/purging (if TOS allows)

The State Energy Data System might provide a more complete dataset for energy price data. It is recommended to switch to this API for calculating heating costs. Note that this data is about 1 year behind as it is annual and takes additional time to process. The direct download link for preliminary 2022 data (this means only up to 2021) is here. The navigate here from the main page, click "Data Files" and navigate to "Prices and expenditures" > "All price and expenditure estimates" > "Prices" > select download format

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •