Skip to content

This repository contains the benchmarking code for TF.

Notifications You must be signed in to change notification settings

toelt-llc/TOELT-tfrabbit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

31.08 Arnaud's merge

The informations about the plots are given in the code README here . The rest of the folders have a short readme to explain the organisation.

tfrabbit

Version 1.0

Author: Umberto Michelucci

This repository contains the benchmarking code for TensorFlow we developed. We wanted something easy to use and easy to interpret. There are many benchmarking suites but we wanted something quick to get an idea about how different configurations compare. The code will change with time so check the version of the code and the version of the script with which the results have been obtained.

Why the name?

You know the story about the rabbit and the turtle? About the race to see who is faster? Well if you know it you can imagine why the name right? This repository is all about benchmarking different systems, especially when regarding GPUs.

Benchmarking

A simple benchmarking script is resnet_benchmark1.py. To use it you need to:

  • git clone https://github.com/toelt-llc/tfrabbit.git
  • cd tfrabbit
  • cd code
  • python resnet_benchmark1.py

This will gives the time needed to train 1 epoch with two networks: VGG19 and resnet50 from scratch with CIFAR-10 Data. Please note that training on a CPU may take 20-30 minutes for each network so be warned. The dataset used is CIFAR-10. The training is performed with ImageGenerator with a batch_size=100. So for 100 effective images.

Sample Results

We are testing the code on many systems, as so far we have just a few numbers that are reported for information below. For specific cases we have below the complete output of the script (just as reference). The results have been obtained with version 1.0 of the script python resnet_benchmark1.py.

Summary

CPU GPU CUDA Version NVIDIA Driver (min) VGG19 Time (min) resnet50 Time (min)
 Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz - 18 Cores NVIDIA RTX A6000 GPU 48 Gb Memory 11.3 465.19.01 0.38 0.41
 Intel(R) Xeon(R) CPU @ 2.20GHz Tesla T4, 15 Gb Memory 11.2 460.32.03 1.05 0.70
 CPU Intel(R) i9 - 2.3 GHz 8-Core None N/A N/A 16.8 17.2
 CPU Intel(R) Core(TM) i7-7700 @ 3.60 Ghz - 4 Cores None N/A N/A 22.2 20.0

As a plot the numbers looks like the figure below

benchmark Figure

Dedicated Deep Learning Linux Server

  • NVIDIA RTX A6000 GPU 48 Gb Memory
  • Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz - 18 Cores

Software stack

  • TensorFlow 2.5
  • CUDA 11.3
  • NVIDIA Driver 465.19.01

Benchmark Output

-------------------------------------------------
Benachmark Results for VGG19

Elapsed Time (min): 0.3828892230987549
-------------------------------------------------
500/500 [==============================] - 25s 43ms/step - loss: 3.0390 - accuracy: 0.1462
-------------------------------------------------
Benachmark Results for resnet50

Elapsed Time (min): 0.4155214587847392
-------------------------------------------------

Macbook Pro 16 in (2020)

  • CPU Intel(R) i9 - 2.3 GHz 8-Core

Software Stack

  • TensorFlow 2.5

Benchmark Output

    -------------------------------------------------
    Benachmark Results for VGG19

    Elapsed Time (min): 16.789986399809518
    -------------------------------------------------
    500/500 [==============================] - 1029s 2s/step - loss: 2.8604 - accuracy: 0.1865
    -------------------------------------------------
    Benachmark Results for resnet50

    Elapsed Time (min): 17.157415350278217
    -------------------------------------------------

Google Colab

  • Intel(R) Xeon(R) CPU @ 2.20GHz
  • Tesla T4, 15 Gb Memory

Software stack

  • TensorFlow 2.5

  • CUDA 11.2

  • NVIDIA Driver 460.32.03

      -------------------------------------------------
      Benachmark Results for VGG19
    
      Elapsed Time (min): 1.0513679345448812
      -------------------------------------------------
      500/500 [==============================] - 42s 72ms/step - loss: 2.9594 - accuracy: 0.1757
      -------------------------------------------------
      Benachmark Results for resnet50
    
      Elapsed Time (min): 0.7041642387708028
      -------------------------------------------------
    

About

This repository contains the benchmarking code for TF.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published