feat: implement multimodal live API #306

nakasyou · 2024-12-14T04:19:43Z

I implemented Multimodal Live API for gemini-2.0-flash-exp. Users using JavaScript also can use Multimodal Live API like Python users.

Example:

import { GoogleGenerativeAI } from '@google/generative-ai'

const model = new GoogleGenerativeAI().getGenerativeModel({ model: 'gemini-2.0-flash-exp' })

const session = await model.connectLive() // Connect to live server

session.send({
  text: 'Hello, what is your name?'
})

for await (const message of session.listen()) {
  // Handling `message` variable.
  // It includes speaking data which is audio/pcm, and base64 encoded.
}

Note: some JSDocs were copy-and-pasted from Multimodal Live API. Is it OK?

google-cla · 2024-12-14T04:19:47Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

src/models/generative-model.ts

Co-authored-by: EdamAmex <[email protected]>

…-ai-js into feat/stream-realtime

feat: implement multimodal stream API

18df0c3

EdamAme-x reviewed Dec 14, 2024

View reviewed changes

src/models/generative-model.ts Outdated Show resolved Hide resolved

nakasyou and others added 5 commits December 14, 2024 14:58

refactor: spread syntax can receive undefined

e37b359

Co-authored-by: EdamAmex <[email protected]>

refactor(types): use the name given in the docs and add JSDocs

29af97d

feat: rename LiveClient to LiveSession

bc5a651

Merge branch 'feat/stream-realtime' of github.com:nakasyou/generative…

8d05413

…-ai-js into feat/stream-realtime

fix: JSON parsing failed on browser

4da4332

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement multimodal live API #306

feat: implement multimodal live API #306

nakasyou commented Dec 14, 2024 •

edited

Loading

google-cla bot commented Dec 14, 2024

feat: implement multimodal live API #306

Are you sure you want to change the base?

feat: implement multimodal live API #306

Conversation

nakasyou commented Dec 14, 2024 • edited Loading

google-cla bot commented Dec 14, 2024

nakasyou commented Dec 14, 2024 •

edited

Loading