Replies: 5 comments 13 replies
-
no need of dirization,etc .. just raw transcription is sufficient |
Beta Was this translation helpful? Give feedback.
-
not much silence .. customer service conversations |
Beta Was this translation helpful? Give feedback.
-
also any suggestions on parameters above to speed the process up would be greatly appreciated |
Beta Was this translation helpful? Give feedback.
-
@shaunck96 Have you solved it? You can anchieve to transcribe all the audios in about 2.5 hours max. Plus it is always a good practice when you use sensitive data |
Beta Was this translation helpful? Give feedback.
-
@shaunck96 You can try Batched Faster-Whipser https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file#batched-faster-whisper |
Beta Was this translation helpful? Give feedback.
-
Has anyone deployed this in production? medium version to transcribe batches of 10,000 calls/day with average duration of 6 minutes per call. Is it feasible from a cost standpoint versus a paid API like Azure STT. Accuracy is not the priority versus cost (as medium size model/distil medium can also be to used to perform transcription) and data privacy as we are dealing with very sensitive data. Leveraging distributed computing with spark capabilities and multi threading, currently with VMs in Azure cloud but limited to CPU for now (E4d_v4, E16d_v4, E32d_v4).
Beta Was this translation helpful? Give feedback.
All reactions