Xelera Random Forest inference engine provides FPGA-enabled real-time deployment for Classification and Regression tasks. It is fully integrated into scikit and H2O driverless AI.
Board | Driver | Shell | Note |
---|---|---|---|
Xilinx Alveo U50 | xrt_201920.2.3.1301 | xilinx-u50-xdma-201920.1 | available upon request |
Xilinx Alveo U200 | xrt_201920.2.3.1301 | xilinx-u200-xdma-201830.2 | available upon request |
Xilinx Alveo U250 | xrt_201920.2.3.1301 | xilinx-u250-xdma-201830.2 | provided with the docker image |
- standalone: end-to-end performance benchmarks on different problem sizes with python and Scikit
- H2O DriverlessAI tuning: algorithm and pipeline tuning with Xelera Random Forest BYOR model
- H2O DriverlessAI deployment: standalone scoring pipeline (Python) for production with Xelera Random Forest BYOR model
The instructions for each run configuration are given below. s
- Contact Xelera at [email protected] and request access to Xelera Random Forest inference engine docker image
- Load (
docker load < Xl_rf_inference.tar.gz
) the provided compressed docker image - Place your H2O DriverlessAI license file in
license/license.sig
(it is not needed for the deployment without H2O DriverlessAI. You can get an evaluation version here) - Start the container using the provided run script:
./run_docker.sh
. Note that this forwards TCP port 12345 from the docker container to the host machine. This port is required by DriverlessAI. - For each sudo command inside the container, use
dai
as password
- Start the provided docker container. You will be logged in as user
dai
in the/app
directory. - Execute
sudo bash run_standalone_benchmark.sh
- It will take some time, since the Random Forest models with large tree amounts have to be trained first.
- The inference results will be put into the file
results.txt
- The trained models will also be exported in
.pkl
format. They can be reloaded for inference only tests. Make the following changes torun_Xl_benchmark_single.py
to enable only inference:enable_training_CPU = False
enable_inference_CPU = True
enable_inference_FPGA = True
- Inside the container, execute
sudo bash init_h2o.sh
. This starts the H2O DriverlessAI backend - In the local browser, navigate to
server:12345
using the address bar.server
is the name or IP address of the host machine running the docker container. - Log in using user:
h2oai
, password:h2oai
- Go to the 'DATASETS' tab and select 'upload dataset'. Choose the file
/app/temps.csv
from the docker filesystem. This file is a dataset of temperature data publicly available at File. More information regarding this file can be found at towardsdatascience in a guided example to Random Forest Predictors. - Go to the 'EXPERIMENTS' tab and select
new experiment
- Select the
temps.csv
data set. In the opening dialogue, selectactual
as target column. If desired, give the experiment a name. - In the bottom right corner, select
EXPERT SETTINGS
- Use the
UPLOAD CUSTOM RECIPE
button to select the delivered recipexelera_byor.py
from the local filesystem. You will see acceptance tests running. - In the 'RECIPES' tab, select values for 'include specific models'. Only select the
XELERA RF FPGA INFERENCE
- In the 'SYSTEM' tab, set the following: 'Number of cores to use = 1' and 'Maximum number of cores to use for model predict = 1". This prohibits multiple processes from accessing the FPGA.
- Leave the expert settings and launch the experiment.
- You can see the progress in the central status bar. Wait until finished.
- Detailed timing information can be found inside the docker in the DriverlessAI log file:
/opt/h2oai/dai/log/dai.log
. - After the experiment has finished, you can download the python scoring pipeline for the deployment using the 'DOWNLOAD PYTHON SCORING PIPELINE' button. The resulting deployment pipeline will be download to the browser local file system.
Reference: Driverless AI Documentation on Python Scoring Pipeline
- As a requirement for this run, you must have completed the H2O DriverlessAI tuning and downloaded the 'Python Scoring Pipeline'
- Extract the downloaded .zip file from the previous step into the directory of the docker container. If not named
scoring-pipeline
, rename the directory to that name. This is required to mount the directory correctly. Copy thescoring-pipeline
directory into the base directory of the docker container. - Inside the container (/app directory), navigate to the scoring-pipeline directory using
cd scoring-pipeline
- Run the deployment pipeline example (provided by H2O) using
sudo ./run_example.sh
- The script will install multiple dependencies in a virtual environment. Compare the )
- In the end, the script will run the python scoring pipeline. You can see the printed messages from the custom recipe, indicating the runtimes and problem sizes. The amount of trees is determined by the DriverlessAI training; the amount of samples is coded in the example script.
- Server Dell PowerEdge R740
- CPU: Intel(R) Xeon(R) Gold 5118 (32 cores) @ 2.30GHz
- RAM: 256GB DDR4 @ 2666 MHz
- Accelerator platform:
- Xilinx Alveo U250
- Training:
- CPU (Scikit)
- Inference:
- FPGA (Xelera)
- CPU (Scikit)
- Dataset: weather -> predict the temperature
- Features: 16 numerical features
- Trees: 8 levels max
- Label: 54 classes
- Forest size: 100, 1000, 10000, 100000 trees
- Samples batch size: 1, 10, 100, 1000, 10000, 100000 samples
Measure RF inference Classifier end-to-end (python application) latency
CPU-based: Intel Xeon 5118 (32 cores) end-to-end latency [s]
Sample batch size \ Number of trees | 100 | 1000 | 10000 | 100000 |
---|---|---|---|---|
1 | 0.113754 | 0.558969 | 5.682151 | 60.4001916 |
10 | 0.112719 | 0.615108 | 5.674495 | 65.8061304 |
100 | 0.113577 | 0.717954 | 6.465377 | 65.3164281 |
1000 | 0.110944 | 0.818687 | 7.682359 | 77.2385877 |
10000 | 0.215964 | 1.42355 | 13.90265 | 136.317463 |
100000 | 1.354374 | 11.20106 | 112.6743 | 1129.38227 |
FPGA-based: Xilinx Alveo U250 end-to-end latency [s]
Sample batch size \ Number of trees | 100 | 1000 | 10000 | 100000 |
---|---|---|---|---|
1 | 0.000523 | 0.000575 | 0.001169 | 0.003734 |
10 | 0.000615 | 0.000583 | 0.001381 | 0.004252 |
100 | 0.000980 | 0.000947 | 0.002377 | 0.010962 |
1000 | 0.004594 | 0.004657 | 0.011452 | 0.055461 |
10000 | 0.029339 | 0.040394 | 0.090505 | 0.424453 |
100000 | 0.214748 | 0.304464 | 0.805108 | 4.161537 |