Xelera Random Forest inference engine

Xelera Random Forest inference engine provides FPGA-enabled real-time deployment for Classification and Regression tasks. It is fully integrated into scikit and H2O driverless AI.

Supported Acceleration Platforms

Board	Driver	Shell	Note
Xilinx Alveo U50	xrt_201920.2.3.1301	xilinx-u50-xdma-201920.1	available upon request
Xilinx Alveo U200	xrt_201920.2.3.1301	xilinx-u200-xdma-201830.2	available upon request
Xilinx Alveo U250	xrt_201920.2.3.1301	xilinx-u250-xdma-201830.2	provided with the docker image

Deployment modes

standalone: end-to-end performance benchmarks on different problem sizes with python and Scikit
H2O DriverlessAI tuning: algorithm and pipeline tuning with Xelera Random Forest BYOR model
H2O DriverlessAI deployment: standalone scoring pipeline (Python) for production with Xelera Random Forest BYOR model

The instructions for each run configuration are given below. s

Base Instructions

Contact Xelera at [email protected] and request access to Xelera Random Forest inference engine docker image
Load (docker load < Xl_rf_inference.tar.gz) the provided compressed docker image
Place your H2O DriverlessAI license file in license/license.sig (it is not needed for the deployment without H2O DriverlessAI. You can get an evaluation version here)
Start the container using the provided run script: ./run_docker.sh. Note that this forwards TCP port 12345 from the docker container to the host machine. This port is required by DriverlessAI.
For each sudo command inside the container, use dai as password

Run Standalone

Start the provided docker container. You will be logged in as user dai in the /app directory.
Execute sudo bash run_standalone_benchmark.sh
It will take some time, since the Random Forest models with large tree amounts have to be trained first.
The inference results will be put into the file results.txt
The trained models will also be exported in .pkl format. They can be reloaded for inference only tests. Make the following changes to run_Xl_benchmark_single.py to enable only inference:
- enable_training_CPU = False
- enable_inference_CPU = True
- enable_inference_FPGA = True

Run H2O DriverlessAI tuning

Inside the container, execute sudo bash init_h2o.sh. This starts the H2O DriverlessAI backend
In the local browser, navigate to server:12345 using the address bar. server is the name or IP address of the host machine running the docker container.
Log in using user: h2oai, password: h2oai
Go to the 'DATASETS' tab and select 'upload dataset'. Choose the file /app/temps.csv from the docker filesystem. This file is a dataset of temperature data publicly available at File. More information regarding this file can be found at towardsdatascience in a guided example to Random Forest Predictors.
Go to the 'EXPERIMENTS' tab and select new experiment
Select the temps.csv data set. In the opening dialogue, select actual as target column. If desired, give the experiment a name.
In the bottom right corner, select EXPERT SETTINGS
Use the UPLOAD CUSTOM RECIPE button to select the delivered recipe xelera_byor.py from the local filesystem. You will see acceptance tests running.
In the 'RECIPES' tab, select values for 'include specific models'. Only select the XELERA RF FPGA INFERENCE
In the 'SYSTEM' tab, set the following: 'Number of cores to use = 1' and 'Maximum number of cores to use for model predict = 1". This prohibits multiple processes from accessing the FPGA.
Leave the expert settings and launch the experiment.
You can see the progress in the central status bar. Wait until finished.
Detailed timing information can be found inside the docker in the DriverlessAI log file: /opt/h2oai/dai/log/dai.log.
After the experiment has finished, you can download the python scoring pipeline for the deployment using the 'DOWNLOAD PYTHON SCORING PIPELINE' button. The resulting deployment pipeline will be download to the browser local file system.

Run H2O DriverlessAI deployment

Reference: Driverless AI Documentation on Python Scoring Pipeline

As a requirement for this run, you must have completed the H2O DriverlessAI tuning and downloaded the 'Python Scoring Pipeline'
Extract the downloaded .zip file from the previous step into the directory of the docker container. If not named scoring-pipeline, rename the directory to that name. This is required to mount the directory correctly. Copy the scoring-pipeline directory into the base directory of the docker container.
Inside the container (/app directory), navigate to the scoring-pipeline directory using cd scoring-pipeline
Run the deployment pipeline example (provided by H2O) using sudo ./run_example.sh
The script will install multiple dependencies in a virtual environment. Compare the )
In the end, the script will run the python scoring pipeline. You can see the printed messages from the custom recipe, indicating the runtimes and problem sizes. The amount of trees is determined by the DriverlessAI training; the amount of samples is coded in the example script.

Standalone Benchmark results

Hardware Setup

Server Dell PowerEdge R740
- CPU: Intel(R) Xeon(R) Gold 5118 (32 cores) @ 2.30GHz
- RAM: 256GB DDR4 @ 2666 MHz
Accelerator platform:
- Xilinx Alveo U250

Problem Setup

Training:
- CPU (Scikit)
Inference:
- FPGA (Xelera)
- CPU (Scikit)
Dataset: weather -> predict the temperature
- Features: 16 numerical features
- Trees: 8 levels max
- Label: 54 classes
Forest size: 100, 1000, 10000, 100000 trees
Samples batch size: 1, 10, 100, 1000, 10000, 100000 samples

Goal

Measure RF inference Classifier end-to-end (python application) latency

Results

CPU-based: Intel Xeon 5118 (32 cores) end-to-end latency [s]

Sample batch size \ Number of trees	100	1000	10000	100000
1	0.113754	0.558969	5.682151	60.4001916
10	0.112719	0.615108	5.674495	65.8061304
100	0.113577	0.717954	6.465377	65.3164281
1000	0.110944	0.818687	7.682359	77.2385877
10000	0.215964	1.42355	13.90265	136.317463
100000	1.354374	11.20106	112.6743	1129.38227

FPGA-based: Xilinx Alveo U250 end-to-end latency [s]

Sample batch size \ Number of trees	100	1000	10000	100000
1	0.000523	0.000575	0.001169	0.003734
10	0.000615	0.000583	0.001381	0.004252
100	0.000980	0.000947	0.002377	0.010962
1000	0.004594	0.004657	0.011452	0.055461
10000	0.029339	0.040394	0.090505	0.424453
100000	0.214748	0.304464	0.805108	4.161537

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
license		license
scoring-pipeline		scoring-pipeline
README.md		README.md
h2o_byor_CPU.py		h2o_byor_CPU.py
h2o_byor_FPGA.py		h2o_byor_FPGA.py
run_Xl_benchmark_single.py		run_Xl_benchmark_single.py
run_docker.sh		run_docker.sh
run_standalone_benchmark.py		run_standalone_benchmark.py
run_standalone_benchmark.sh		run_standalone_benchmark.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Xelera Random Forest inference engine

Supported Acceleration Platforms

Deployment modes

Base Instructions

Run Standalone

Run H2O DriverlessAI tuning

Run H2O DriverlessAI deployment

Standalone Benchmark results

Hardware Setup

Problem Setup

Goal

Results

About

Releases

Packages

Languages

andyluo7/RFinference

Folders and files

Latest commit

History

Repository files navigation

Xelera Random Forest inference engine

Supported Acceleration Platforms

Deployment modes

Base Instructions

Run Standalone

Run H2O DriverlessAI tuning

Run H2O DriverlessAI deployment

Standalone Benchmark results

Hardware Setup

Problem Setup

Goal

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages