Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Promote vision functions out of beta #926

Merged
merged 7 commits into from
May 14, 2024
Merged

Promote vision functions out of beta #926

merged 7 commits into from
May 14, 2024

Conversation

jlowin
Copy link
Member

@jlowin jlowin commented May 14, 2024

Vision is no longer in beta, as it is officially supported on both GPT-4-Turbo and GPT-4o. That means we can move Marvin's vision functions out of beta as well.

Happily, the final versions of the API are much more powerful than the ones we built the original vision functions for. Marvin's original, beta implementation of the various vision-enhanced required two calls to the LLM: one to caption the images with an eye toward capturing detail relevant for the processing task at hand, and a second to take the resulting text and process it (e.g. for classification, casting, etc.).

The final API can do that in a single pass, so I've modified all the classic Marvin functions (cast, classify, extract) to accept image inputs in addition to strings (or, most interestingly, a mix of images and strings!). These work well and are much faster (almost twice as fast!) than the two-pass beta versions.

@jlowin
Copy link
Member Author

jlowin commented May 14, 2024

Note: this is technically a breaking change since it replaces e.g. marvin.beta.classify with marvin.classify, but that's why those functions were in the beta namespace to begin with.

Copy link
Collaborator

@zzstoatzz zzstoatzz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jlowin jlowin merged commit aedfb95 into main May 14, 2024
15 checks passed
@jlowin jlowin deleted the images branch May 14, 2024 11:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants