This repo contains dockerfile and script to build/pull, run docker images for cross-compilation on a powerful desktop/laptop, or directly run on Jetson hardware platforms out-of-the-box.
-
Prerequisites:
-
On a Jetson Device (Currently only supports Xavier NX flashed with R32.7)
-
Have CUDA installed locally
-
To check:
sudo apt list --installed | egrep -i "nvidia-cuda|nvidia-l4t-cuda" ls -lah /usr/local/cuda-10.2/
If both packages appear and the
ls
shows reasonable outputs then CUDA is locally installed This step is necessary since we will directly mount the/usr/local/cuda-10.2/
directory onto the container
-
-
Have nvidia-docker installed
-
To check:
sudo apt list --installed | egrep -i "nvidia-docker2"
If the package appears then it's installed
-
-
-
To build:
./scripts/build.sh
-
To pull:
./scripts/pull.sh
-
To run:
./scripts/run.sh
- Feel free to modify the default
run.sh
to append additional parameters/mounts - You can try to mount the
/opt/nvidia/vpi1
directory onto the container since the dev & libs of VPI 1 are all in this directory, but it's not necessary since the dockerfile already contains the directory- To find out the dependencies of
nvidia-vpi
, you can executesudo apt update && sudo apt show nvidia-vpi
on a Jetson device; to find out the files packages (e.g. VPI 1 here) contains, you can executesudo dpkg -L nvidia-vpi vpi1-dev libnvvpi1
on a Jetson device
- To find out the dependencies of
- Feel free to modify the default
- This repository uses
docker buildx
to (cross-)compile docker image, check your available target architecture by executingdocker buildx ls
in a shell terminal- If it doesn't support the desired architecture, install emulator from binfmt by executing
docker run --privileged --rm tonistiigi/binfmt --install all
in a shell terminal - With docker image cross-compile enabled, you can build the non-GPU part of this docker image on your x86_64 desktop/laptop
- If it doesn't support the desired architecture, install emulator from binfmt by executing
- The base is from Nvidia NGC L4T ML container images
- Supports JetPack > 4.4, i.e. L4T > R32.4.2
- Contains PyTorch, TensorFlow, onnx, CUDA, cuDNN, etc. Check the website Overview for more info on distribution-specific libraries.
- The dockerfile in this repo further extends the libraries by installing the following libraries for robotics applications:
- Torch-TensorRT (depends on cuDNN, CUDA, and CUDA Arch BIN version)
- VPI
- Ceres Solver
- oh-my-zsh (for devel purpose)
- For more hardware specific installs like VPI, please login to your Jetson hardware, and then:
- Execute
sudo apt update
orcat /etc/apt/sources.list.d/nvidia-l4t-apt-source.list
in a shell terminal, find the sources contain repo.download.nvidia.com, and modify the lines in the dockerfile under docker folder that adds the apt repo- Check Jetson apt repo for more details
- Basically, the 1st level is the Jetpack version, and the 2nd level contains hardware-specific distributions and a
common
for all hardware. e.g., for Xavier NX installed with L4T R32.7.3, it'll go toJetpack 4.6.x
, thent194
, which contains*nvidia-jetpack*.deb
(the ensemble package of JetPack SDK); forx86_64
development on desktop/laptop platforms, it'll go toJetpack 4.6.x
, thenx86_64/bionic
orx86_64/xenial
depending on your Ubuntu distribution
- Basically, the 1st level is the Jetpack version, and the 2nd level contains hardware-specific distributions and a
- Check Jetson apt repo for more details
- Install jtop if you haven't already, then execute
jtop
in a shell terminal, clickinfo
tab on the bottom to check versions of all the installed libraries and supported hardware bin, important ones include:- CUDA and CUDA Arch BIN
- L4T and Jetpack
- cuDNN
- TensorRT
- VPI
- OpenCV
- Execute
-
If you encountered the following error:
unknown flag: --platform
Then you need to install
buildx
plugin bysudo apt install docker-buildx-plugin
If you do not see
docker-buildx-plugin
available or it doesn't solve the previous problem, please follow the official guide to install the completedocker engine
-
Building Torch-TensorRT from source
-
On Jetson platforms, NVIDIA hosts pre-built Pytorch wheel files. These wheel files are built with CXX11 ABI. You'll also notice that there're
Pre CXX11 ABI
andCXX11 ABI
versions of libtorch on the official download website of PyTorch -
What's Pre CXX11 ABI and CXX11 ABI? You can ask ChatGPT, and here's its answer:
C++ Application Binary Interface (ABI) is the specification to which executable code adheres in order to facilitate correct interaction between different executable components. This includes conventions for name mangling, exception handling, calling conventions, and the layout of object code and system libraries. The term "Pre-CXX ABI" likely refers to a version of the C++ ABI that was in use before a specific change was introduced. An ABI can change over time as new language features are added, compilers improve, or for other reasons. When such changes occur, binary code compiled with a newer version of the compiler may not be compatible with code compiled with an older version, due to different expectations about how things like name mangling or exception handling work. One notable ABI break in C++ occurred with the release of GCC 5.1. This release changed the ABI in a way that was not backwards-compatible, primarily to improve the implementation of C++11's std::string and std::list types. The ABI used by versions of GCC prior to this change is often referred to as the "old" or "pre-CXX11" ABI. Code compiled with the new ABI cannot be safely linked with code compiled with the old ABI.
This basically means that
Pre CXX11 ABI
andCXX11 ABI
are two distinct versions of a library, and cannot be used in a mixture. Since Torch-TensorRT depends on PyTorch, whether to usePre CXX11 ABI
orCXX11 ABI
also depends on how PyTorch is built. To check this, you can directly consult torch in python3:python3 Python 3.6.9 (default, Mar 10 2023, 16:46:00) [GCC 8.4.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import torch >>> torch._C._GLIBCXX_USE_CXX11_ABI True
This shows that the installed PyTorch is compiled with
CXX11 ABI
, which means that the libtorch under the hood is also compiled withCXX11 ABI
-
-
Setup your development environment
sudo ./scripts/setup.sh
This will install
pre-commit
hooks and its dependencies locally, so that each time before you commit, the code will be formatted and linted automatically. Remember togit add .
aftergit commit
failed sincepre-commit
will modify source code in-place. -
Check the GitHub webpage after a few minutes to see if the CI passed. Passing CI will have a green check mark on the commit
- If not, please fix the errors and push again
- Add more supported Jetson hardware + L4T version, currently supported:
R32.7 R35.3 ... Orin AGX X X X NX X X X Xavier AGX X X X NX ✓ X X TX 1 x N/A N/A 2 x N/A N/A NANO x N/A N/A - Only VPI and TensorRT are hardware specific, i.e., they are provided by
deb https://repo.download.nvidia.com/jetson/t194 r32.7 main
, so you can still use the docker image for NX on AGX if you don't need to use VPI and TensorRT - R32.7.3 is the last L4T version that supports Ubuntu 18.04, later versions ( > R34.1 ) are all on Ubuntu 20.04
- Only VPI and TensorRT are hardware specific, i.e., they are provided by
- Follow the same naming convention and format for
dockerfile
andbuild.sh
, put the CPU-only build indockerfile
and put the GPU-required parts inbuild.sh
- There were attempts to put GPU-required parts in
dockerfile
, however, it didn't work, here's what's been tried:-
Modify
/etc/docker/daemon.json
to:{ "runtimes": { "nvidia": { "path": "nvidia-container-runtime", "runtimeArgs": [] } }, "default-runtime": "nvidia" }
Then
sudo systemctl restart docker
, this enables--runtime=nvidia
in dockerfile build stage, but the/usr/local/cuda-10.2/
inside the docker is still missing important libraries to build GPU-required libraries -
Save state of
/usr/local/cuda-10.2/
temporarily,COPY
recursively from a--build-context cuda-config=/usr/local/cuda-10.2/
, then build GPU-required libraries, finally restore/usr/local/cuda-10.2/
to the state beforeCOPY
- This didn't work since there were more required libraries under
/usr/lib/aarch64-linux-gnu/
likelibcudnn.so
, but the directory contains almost all the other unused libraries and is too big to perform the previous practice
- This didn't work since there were more required libraries under
-
- There were attempts to put GPU-required parts in
- Author: Mukai (Tom Notch) Yu: [email protected]