Back to AIBriefs
AnalysisAI ModelsDevelopers
Featured

Thor Schaeff demos Gemini's audio stack in new talk

Gemini 3 Flash Preview can produce speaker labels, timestamps, emotion tags, and language detection with translation in one API call. The talk covers speech generation and live music capabilities, demonstrating the underlying audio understanding layer.

·
1 day ago
Thor Schaeff demos Gemini's audio stack in new talk — AIBriefs