-
-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Enhancement] Pre-process Audio #66
Comments
The docs say it needs CUDA 11.8+ This will need to be upgraded: |
@joecryptotoo Do you want to try working on this? |
@dnhkng I don't think I have the skills for this one, but I'm happy to help research it. |
That repo looks surprisingly old for audio AI! |
This microservice might do the job. https://catalog.ngc.nvidia.com/orgs/nvidia/teams/maxine/helm-charts/ucf-audio-multistream |
I tried to download the SDK for this thing so I can run it on my Linux server on my RTX 4090 but it looks like they only support these Nvidia products on Linux: a40/a30/a2/v100/a10/t4/a16/a100 while any RTX GPU can be used on Windows
|
I found a discussion about this on reddit here: https://www.reddit.com/r/linux/comments/vs9pdd/for_those_who_also_want_nvidia_rtx_voice_on_linux/ Someone suggested this project as an alternative: |
this is a really cool feature in my opinion, any luck so far? |
would denoising or echo cancellation type approach really solve this issue? |
The best option is to use a conference room microphone/speaker combo. They do hardware noice cancellation, and dont waste GPU VRAM on a signal processing model. |
There is an interesting set of Nvidia models for audio processing:
https://docs.nvidia.com/deeplearning/maxine/audio-effects-sdk/index.html
Of particular interest are:
The first could clean up input audio before it gets passed to ASR. The second takes two streams, the recorded audio and Glados's voice output, and could help remove her voice from what she's listening too. This would improve the 'interruption' feature significantly!
Looking for coders to help out with this!
The text was updated successfully, but these errors were encountered: