You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Adding support to generate audio output would be great.
More details
With gemini-2.0-flash-exp Gemini can now support the generation of audio output. The generated audio response can be downloaded afterwards as well.
To access this feature on GCP you need to navigate to ‘Vertex AI’ → ‘Vertex AI Studio’ → ‘Freeform’ → Select ‘gemini-2.0-flash-exp’ as model and ‘Audio’ as response output type. The attached recording shows the process (sadly without the audio). The text was read out when clicking on the play button. Running the same prompt multiple times results in different audio files (different voices, speed, pronunciation, etc.)
The UI needs to be adjusted for models that allow this kind of output. Probably some endpoints need to be adjusted as well to deal with the new kind of data.
Note: Currently the feature is still experimental.
Which components are impacted by your request?
UI, General, Endpoints
Pictures
Code of Conduct
I agree to follow this project's Code of Conduct
The text was updated successfully, but these errors were encountered:
What features would you like to see added?
Adding support to generate audio output would be great.
More details
With gemini-2.0-flash-exp Gemini can now support the generation of audio output. The generated audio response can be downloaded afterwards as well.
To access this feature on GCP you need to navigate to ‘Vertex AI’ → ‘Vertex AI Studio’ → ‘Freeform’ → Select ‘gemini-2.0-flash-exp’ as model and ‘Audio’ as response output type. The attached recording shows the process (sadly without the audio). The text was read out when clicking on the play button. Running the same prompt multiple times results in different audio files (different voices, speed, pronunciation, etc.)
The UI needs to be adjusted for models that allow this kind of output. Probably some endpoints need to be adjusted as well to deal with the new kind of data.
Note: Currently the feature is still experimental.
Which components are impacted by your request?
UI, General, Endpoints
Pictures
Code of Conduct
The text was updated successfully, but these errors were encountered: