- Installation
- Project Motivation
- File Descriptions
- Licensing, Authors, and Acknowledgements
- View the notebook
- BeautifulSoup library for web scraping. Please see docs and install instructions here: https://www.crummy.com/software/BeautifulSoup/bs4/doc/
Other: standard Python libraries across Anaconda distribution.
The code should run with no issues using Python versions 3.*.
I was interested in examining the composition of FAA component repair stations in the United States.
Here are the questions I considered through the analysis:
- How many FAA repair stations are in each U.S. state?
- For the states with the most repair stations, what are their FAA ratings (specializations)?
- What is the composition of small businesses (< 20 employees) to large businesses across the repair station landscape?
- What percentage of the repair station workforce is skilled (certificated), and how does composition vary from small to large businesses?
Files include a single .ipynb and three supporting .csv files.
.ipynb: Contains all code and visualizations. The .ipynb flows linearly from data collection, cleaning & processing, to modeling and visualizations.
.csv files: Source data (initial_export_faa.csv); webscrape results (scraped_stations.csv); and final consolidated data set after collecting and scraping (all_data_faa.csv) - please use this data set for your own projects!
The source data for this project originated from the FAA's website: https://av-info.faa.gov/repairstation.asp The initial export from the site - for this project - was done on 05/24/2020. The data is updated weekly by the FAA, and may vary by the time you are exploring the files.
All code is open for any and all usage.
View the notebook at Jupyter's NBviewer site, click here.