In this exercise we will get "batch data" from an open data source and store into the file system:
HTTP service --> NiFi --> File
Launch Apache NiFi:
docker compose up -d nifi
This will be the data source (Valencia Open Data - ValenBisi Availability):
In that dataset, look for the CSV URL.
Some tips:
- Use the following processors
- InvokeHTTP
- PutFile
- Connect the processors with the "response" output, and terminate the others
- WARNING: Check the “SCHEDULING” when invoking the HTTP URL and set to something reasonable (e.g. 30 secs), otherwise we will saturate the server
- Apache NiFi (local): https://localhost:8443/nifi
- Apache NiFi Documentation: https://nifi.apache.org/docs.html
- Valencia Open Data: http://gobiernoabierto.valencia.es