Voice Chat
Real-time audio-to-audio conversation. Native speech understanding with 30 selectable voices.
Configuration
Audio Controls
Transcription
Click "Microphone Off" to grant mic access and start talking.
Text Chat
You type; the model replies with native audio. Choose transcript-only (speaker off), audio-only, or both. Matches Live API requirements for Gemini 3.1 Flash Live.
Configuration
Chat
Vision
Send camera or screen video alongside audio/text. The model sees JPEG frames at up to 1 FPS.
Video Source
Preview
Response
Click Camera or Screen Share to grant access and start.
Function Calling
The model can call functions you define. Supports synchronous tool execution with the Live API.
Registered Functions
Get current weather for a city. Params: city (string)
Evaluate a math expression. Params: expression (string)
Get the current time. Params: timezone (string, optional)
Roll dice. Params: sides (int), count (int)
Conversation
Function Call Log
Function calls and responses will appear here.
Google Search Grounding
Ground responses in real-time Google Search results for up-to-date, factual answers.
Search-Grounded Chat
Thinking
Configure thinking depth: minimal (fastest) to high (most thorough). View the model's reasoning process.
Configuration
Conversation
Settings
Global configuration for session behavior.
Connection
Voice Activity Detection
Used for Voice Chat and Vision (realtime audio). Not sent for text-only features.
Session Management
Allows reconnecting to a session within 2 hours if disconnected.
Sliding window compression for unlimited session length.
Audio
Audio formats are fixed by the API. The browser handles conversion automatically.
Model Capabilities
Protocol Log
Raw WebSocket messages exchanged with the Gemini Live API.
WebSocket messages will appear here when a session is active.