Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

基于realtime-quickstart-react的DEMO在COZE平台上做了一个智能体,如何实现视频识别和工具调用 #110

Open
l1985q opened this issue Feb 23, 2025 · 1 comment

Comments

@l1985q
Copy link

l1985q commented Feb 23, 2025

实在找不到真人问这个问题了。我使用realtime-quickstart-react配套的视频做了一个智能体,选择视觉理解的基础模型,能够实现对话和视频信号的理解,但是加上其它工作流调用就是不成功。如果把基础模型换成豆包工具调用就能正常调用工作流,但视频理解就不能用了,如何实现两者都能用呢?

@jackshen3102
Copy link
Contributor

jackshen3102 commented Feb 24, 2025

实在找不到真人问这个问题了。我使用realtime-quickstart-react配套的视频做了一个智能体,选择视觉理解的基础模型,能够实现对话和视频信号的理解,但是加上其它工作流调用就是不成功。如果把基础模型换成豆包工具调用就能正常调用工作流,但视频理解就不能用了,如何实现两者都能用呢?

可以参考https://bytedance.larkoffice.com/docx/EylRdKjIMojqJPxlJTUc8EBDnEh 搭建自己的智能体试试

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants