Skip to content

Latest commit



107 lines (99 loc) · 7.56 KB

File metadata and controls

107 lines (99 loc) · 7.56 KB

Tensorflow x CUDA Server Preconfiguration

Modified: 2021-05


For x86 Ubuntu 20.04 Servers: Install nvidia CUDA toolkit:

sudo sh

Test nvidia driver:

Tue May 25 19:27:22 2021       
| NVIDIA-SMI 460.73.01    Driver Version: 460.73.01    CUDA Version: 11.2     |
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|   0  GeForce GTX 970     Off  | 00000000:01:00.0 Off |                  N/A |
|  0%   46C    P8    15W / 200W |     15MiB /  4041MiB |      0%      Default |
|                               |                      |                  N/A |
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|    0   N/A  N/A      1302      G   /usr/lib/xorg/Xorg                  8MiB |
|    0   N/A  N/A      1487      G   /usr/bin/gnome-shell                2MiB |

Install nvidia-docker:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID)    && curl -s -L | sudo apt-key add -    && curl -s -L$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
sudo apt update
sudo apt install -y nvidia-docker2
sudo reboot

Test nvidia docker link:

sudo systemctl restart docker
sudo docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi
Tue May 25 19:27:22 2021       
| NVIDIA-SMI 460.73.01    Driver Version: 460.73.01    CUDA Version: 11.2     |
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|   0  GeForce GTX 970     Off  | 00000000:01:00.0 Off |                  N/A |
|  0%   46C    P8    15W / 200W |     15MiB /  4041MiB |      0%      Default |
|                               |                      |                  N/A |
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|    0   N/A  N/A      1302      G   /usr/lib/xorg/Xorg                  8MiB |
|    0   N/A  N/A      1487      G   /usr/bin/gnome-shell                2MiB |

Test tensorflow CUDA link:

docker run -it --rm --runtime=nvidia tensorflow/tensorflow:latest-gpu python -c "import tensorflow as tf; print(tf.reduce_sum(tf.random.normal([1000, 1000])))"
2021-05-25 23:29:48.607229: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.806968: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.841178: I tensorflow/core/common_runtime/gpu/] Found device 0 with properties: 
pciBusID: 0000:01:00.0 name: GeForce GTX 970 computeCapability: 5.2
coreClock: 1.253GHz coreCount: 13 deviceMemorySize: 3.95GiB deviceMemoryBandwidth: 208.91GiB/s
2021-05-25 23:29:49.841215: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.843860: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.843898: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.844789: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.845007: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.845935: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.846697: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.846844: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:49.847969: I tensorflow/core/common_runtime/gpu/] Adding visible gpu devices: 0
2021-05-25 23:29:49.848312: I tensorflow/core/platform/] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-05-25 23:29:49.849489: I tensorflow/core/common_runtime/gpu/] Found device 0 with properties: 
pciBusID: 0000:01:00.0 name: GeForce GTX 970 computeCapability: 5.2
coreClock: 1.253GHz coreCount: 13 deviceMemorySize: 3.95GiB deviceMemoryBandwidth: 208.91GiB/s
2021-05-25 23:29:49.850424: I tensorflow/core/common_runtime/gpu/] Adding visible gpu devices: 0
2021-05-25 23:29:49.850460: I tensorflow/stream_executor/platform/default/] Successfully opened dynamic library
2021-05-25 23:29:50.306046: I tensorflow/core/common_runtime/gpu/] Device interconnect StreamExecutor with strength 1 edge matrix:
2021-05-25 23:29:50.306095: I tensorflow/core/common_runtime/gpu/]      0 
2021-05-25 23:29:50.306103: I tensorflow/core/common_runtime/gpu/] 0:   N 
2021-05-25 23:29:50.307221: I tensorflow/core/common_runtime/gpu/] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 3345 MB memory) -> physical GPU (device: 0, name: GeForce GTX 970, pci bus id: 0000:01:00.0, compute capability: 5.2)
tf.Tensor(381.3887, shape=(), dtype=float32)