Skip to content

Commit

Permalink
update doc for using one socket with latency hint instead of one numa…
Browse files Browse the repository at this point in the history
… node
  • Loading branch information
wangleis committed Dec 30, 2024
1 parent 2ef42d4 commit ef1298d
Showing 1 changed file with 14 additions and 14 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -63,19 +63,19 @@ the model precision and the ratio of P-cores and E-cores.

Then the default settings for low-level performance properties on Windows and Linux are as follows:

+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
| Property | Windows | Linux |
+======================================+=======================================================================+=======================================================================+
| ``ov::num_streams`` | 1 | 1 |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
| ``ov::inference_num_threads`` | is equal to the number of P-cores or P-cores+E-cores on one numa node | is equal to the number of P-cores or P-cores+E-cores on one numa node |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
| ``ov::hint::scheduling_core_type`` | :ref:`Core Type Table of Latency Hint <core_type_latency>` | :ref:`Core Type Table of Latency Hint <core_type_latency>` |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
| ``ov::hint::enable_hyper_threading`` | No | No |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
| ``ov::hint::enable_cpu_pinning`` | No / Not Supported | Yes except using P-cores and E-cores together |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+
| Property | Windows | Linux |
+======================================+====================================================================+====================================================================+
| ``ov::num_streams`` | 1 | 1 |
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+
| ``ov::inference_num_threads`` | is equal to the number of P-cores or P-cores+E-cores on one socket | is equal to the number of P-cores or P-cores+E-cores on one socket |
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+
| ``ov::hint::scheduling_core_type`` | :ref:`Core Type Table of Latency Hint <core_type_latency>` | :ref:`Core Type Table of Latency Hint <core_type_latency>` |
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+
| ``ov::hint::enable_hyper_threading`` | No | No |
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+
| ``ov::hint::enable_cpu_pinning`` | No / Not Supported | Yes except using P-cores and E-cores together |
+--------------------------------------+--------------------------------------------------------------------+--------------------------------------------------------------------+

.. note::

Expand All @@ -96,7 +96,7 @@ Then the default settings for low-level performance properties on Windows and Li
Starting from 5th Gen Intel Xeon Processors, new microarchitecture enabled new sub-NUMA clusters
feature. A sub-NUMA cluster (SNC) can create two or more localization domains (numa nodes)
within a socket by BIOS configuration.
By default OpenVINO with latency hint uses single NUMA node for inference. Although such
By default OpenVINO with latency hint uses single socket for inference. Although such
behavior allows to achive best performance for most of the models, there might be corner
cases which require manual tuning of ``ov::num_streams`` and ``ov::hint::enable_hyper_threading parameters``.
Please find more detail about `Sub-NUMA Clustering <https://www.intel.com/content/www/us/en/developer/articles/technical/xeon-processor-scalable-family-technical-overview.html>`__
Expand Down

0 comments on commit ef1298d

Please sign in to comment.