Consider removing resampling step #13

cvitolo · 2021-04-23T08:30:15Z

Raw features are in a gridded format (either GRIB or NetCDF) and are resampled to a common spatial resolution in the pre-processing step. After resampling, features are extracted at each point of interest. The machine learning models implemented use in input a table where features and outcomes are columns and each row correspond to a point (lon and lat coordinates).

Please note the resampling step introduces errors/loss of information and it's not necessarily needed in this case. Please consider inspecting each raw feature in its original spatial resolution, to minimise information loss.

…. This addresses issue #13.

cvitolo · 2021-04-30T12:15:55Z

This issue is solved in branch: cvitolo-patch-1.
Training script will need some modification to accept new inputs.
Will close this issue when the branch is merged into master.

* Updated paths * Implemented conversion of Burned Area from m2 to hectares before the fuel load is calculated in fuelload.py. This addresses issue #10. * Added new notebook with more concise pre-processing. It includes the conversion to dataframe (model input). It solves issues #10 and #12. It also defines a new threshold for BA (50 hectares, see #11, the new threshold is defined by FDG - fire expert) but a reliable reference source is still not available. * Notebook notebooks/preprocess_all_in_one.ipynb reformatted using black * Formatted src/utils/fuelload.py using black * Minor changes to notebooks/preprocess_all_in_one.ipynb * renamed notebook and finalised concise version of the data preparation step * Complete re-write of the data pre-processing step to avoid resampling. This addresses issue #13. * Added the following amongst predictors: GFED4 basis regions (as categorical variable) and area of grid cell at point (as continuous variable). * Load formula changed to BA*CC*AGB/AREA * added log-transformed variables * updated notebooks with latest run * model 6h, MAE * experiments as in ESA-D1 report * Updated README files to clarify there are two sets of experiments (by wikilimo and ecmwf). * Update README.md

cvitolo added the enhancement New feature or request label Apr 23, 2021

cvitolo added a commit that referenced this issue Apr 30, 2021

Complete re-write of the data pre-processing step to avoid resampling…

e524f1d

…. This addresses issue #13.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider removing resampling step #13

Consider removing resampling step #13

cvitolo commented Apr 23, 2021

cvitolo commented Apr 30, 2021

Consider removing resampling step #13

Consider removing resampling step #13

Comments

cvitolo commented Apr 23, 2021

cvitolo commented Apr 30, 2021