add known issue of EmbeddingBag INT8 accuracy loss (#1017)

jingxu10 · web-flow · commit e2fbae0bec3a · 2022-08-04T14:11:05.000+08:00
* add known issue

restate the embedding issue

* add release notes for 1.12.100
diff --git a/docs/tutorials/performance_tuning/known_issues.md b/docs/tutorials/performance_tuning/known_issues.md
@@ -1,6 +1,8 @@
 Known Issues
 ============
 
+- Supporting of EmbeddingBag with INT8 when bag size > 1 is working in progress.
+
 - Compiling with gcc 11 might result in `illegal instruction` error.
 
 - `RuntimeError: Overflow when unpacking long` when a tensor's min max value exceeds int range while performing int8 calibration. Please customize QConfig to use min-max calibration method.
diff --git a/docs/tutorials/releases.md b/docs/tutorials/releases.md
@@ -1,6 +1,10 @@
 Releases
 =============
 
+## 1.12.100
+
+This is a patch release to fix the AVX2 issue that blocks running on non-AVX512 platforms.
+
 ## 1.12.0
 
 We are excited to bring you the release of Intel® Extension for PyTorch\* 1.12.0-cpu, by tightly following PyTorch [1.12](https://github.com/pytorch/pytorch/releases/tag/v1.12.0) release. In this release, we matured the automatic int8 quantization and made it a stable feature. We stabilized runtime extension and brought about a MultiStreamModule feature to further boost throughput in offline inference scenario. We also brought about various enhancements in operation and graph which are positive for performance of broad set of workloads.