Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any plan about support video and audio? #35

Open
thesby opened this issue Nov 5, 2024 · 1 comment
Open

Any plan about support video and audio? #35

thesby opened this issue Nov 5, 2024 · 1 comment

Comments

@thesby
Copy link

thesby commented Nov 5, 2024

Ovis is really good. Could you please support video and audio?

@runninglsy
Copy link
Collaborator

Thank you for your positive feedback on Ovis.

It's common practice to extract multiple frames from a video to create a multi-image input. While Ovis1.6 is primarily trained on single-image samples, it also supports multi-image inputs. An example is available at: #25

On the other hand, we are currently working on incorporating video data into our training process and plan to enhance video processing capabilities in future versions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants