Promote vision functions out of beta #926

jlowin · 2024-05-14T02:30:05Z

Vision is no longer in beta, as it is officially supported on both GPT-4-Turbo and GPT-4o. That means we can move Marvin's vision functions out of beta as well.

Happily, the final versions of the API are much more powerful than the ones we built the original vision functions for. Marvin's original, beta implementation of the various vision-enhanced required two calls to the LLM: one to caption the images with an eye toward capturing detail relevant for the processing task at hand, and a second to take the resulting text and process it (e.g. for classification, casting, etc.).

The final API can do that in a single pass, so I've modified all the classic Marvin functions (cast, classify, extract) to accept image inputs in addition to strings (or, most interestingly, a mix of images and strings!). These work well and are much faster (almost twice as fast!) than the two-pass beta versions.

jlowin · 2024-05-14T02:31:51Z

Note: this is technically a breaking change since it replaces e.g. marvin.beta.classify with marvin.classify, but that's why those functions were in the beta namespace to begin with.

zzstoatzz

✨

jlowin added 3 commits May 13, 2024 20:32

Update transcript to support images

ad68b95

Update functions + docs

8ed2790

Remove beta references

c7b19b6

jlowin added 4 commits May 13, 2024 22:34

Fix tests

55a7848

Update tests

cf6aafb

Add instructions

ba62b63

Update flaky tests

30a4680

zzstoatzz approved these changes May 14, 2024

View reviewed changes

jlowin merged commit aedfb95 into main May 14, 2024
15 checks passed

jlowin deleted the images branch May 14, 2024 11:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Promote vision functions out of beta #926

Promote vision functions out of beta #926

jlowin commented May 14, 2024 •

edited

Loading

jlowin commented May 14, 2024

zzstoatzz left a comment

Promote vision functions out of beta #926

Promote vision functions out of beta #926

Conversation

jlowin commented May 14, 2024 • edited Loading

jlowin commented May 14, 2024

zzstoatzz left a comment

Choose a reason for hiding this comment

jlowin commented May 14, 2024 •

edited

Loading