Brilliant Labs Frame - Realtime Gemini Voice and Vision Demo

Click on the images below, or on this text to view the video.

Demonstrates the integration of a multimodal realtime assistant with Brilliant Labs Frame.

In addition to audio streaming, the UI shows images that are streamed to the model, along with metadata about the conversation (turn taking and interruptions.)

The system prompt is editable to allow for customization of the assitant.

Each of the available Gemini Multimodal Live API voices (Puck, Charon, Kore, Fenrir, Aoede) are available for selection (the conversation needs to be restarted for a voice change to take effect.)

Gemini API Setup

The realtime assistant is provided through the Google Gemini Multimodal Live API, and API keys (currently with a free usage tier) are available with registration: See here.

Add your API key in the text box at the top of the screen of the demo app and "Save".

Please note, API queries using the free usage tier can be used for training Google's models, but queries using paid keys should not. Refer to Google's documentation for details.

Running the Demo

Grab your Brilliant Labs Frame, then:

Android: download and run the APK from Releases
iOS: clone, build and deploy to your iPhone using Flutter and Xcode from your Mac.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
android		android
assets		assets
docs		docs
ios		ios
lib		lib
.gitignore		.gitignore
.metadata		.metadata
LICENSE.md		LICENSE.md
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brilliant Labs Frame - Realtime Gemini Voice and Vision Demo

Gemini API Setup

Running the Demo

About

Releases 1

Packages

Languages

License

brilliantlabsAR/frame_realtime_gemini_voicevision

Folders and files

Latest commit

History

Repository files navigation

Brilliant Labs Frame - Realtime Gemini Voice and Vision Demo

Gemini API Setup

Running the Demo

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages