-
I currently use RTX4090 to generate subtitles, but I still don’t understand much about compute_type selection. I also read the introduction here [https://opennmt.net/CTranslate2/quantization.html]. Regarding the computing power of RRTX4090, is the matching compute_type type float32 or other settings? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
Use fastest for you. |
Beta Was this translation helpful? Give feedback.
-
Number of visible GPU devices: 1 Supported compute types by GPU: {'int8_float16', 'int8_float32', 'bfloat16', 'int8_bfloat16', 'int8', 'float32', 'float16'} [2023-09-30 09:19:56.036] [ctranslate2] [thread 13576] [info] CPU: GenuineIntel (SSE4.1=true, AVX=true, AVX2=true, AVX512=false) Model loaded in: 9.45 seconds According to the test, int8_float16 should be selected? |
Beta Was this translation helpful? Give feedback.
Most accurate theoretically can be most inaccurate in practice. :)
Use fastest, that's it.
On different samples different types can be more accurate.