Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deepgram Self-hosted #1501

Open
beastoin opened this issue Dec 9, 2024 · 1 comment
Open

Deepgram Self-hosted #1501

beastoin opened this issue Dec 9, 2024 · 1 comment
Assignees

Comments

@beastoin
Copy link
Collaborator

beastoin commented Dec 9, 2024

currently we have the limits 100 concurrencies for DG + 100 concurrencies soniox
if we hit 10K user then we will need ~ 300 concurrencies. then if 100K ? yes 3000 concurrencies

--from Damien(DG team)
Then additional 100 concurrencies are $10k on each tier
If you self host Deepgram on your own infra there is no concurrency cost since you can scale as large as you like with your own GPUs
We have docker images and helm charts if you want to deploy on Kubernetes etc
https://developers.deepgram.com/docs/self-hosted-introduction
We will need an MNDA in place to provide access and share benchmarks for different GPUs

@beastoin beastoin converted this from a draft issue Dec 9, 2024
@beastoin beastoin moved this to To do in omi TODO Dec 23, 2024
@beastoin
Copy link
Collaborator Author

hi a @thainguyensunya it's time 🌚

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: To do
Development

No branches or pull requests

2 participants