You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi I was reading the vertex ai documentation - [https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/audio-understanding#:~:text=files%2C%20enable%20the-,audioTimestamp,-parameter%20in%20GenerationConfig]
Here it is mentioned:
2. Audio-only timestamps: To accurately generate timestamps for audio-only files, you must configure the audio_timestamp parameter in generation_config.
But I am not able to set this parameter in generation_config.
The text was updated successfully, but these errors were encountered:
Hi @Waheguru-Anurag, this field was recently added (last week) and is currently only available in the REST API. The Python SDK hasn't been updated to support it yet.
I tried this parameter using the REST API but didn't notice an improvement.
For a 3 min long file timestamps suggested it was over 4 minutes long.
I used this prompt:
Translate the audio to english. Include timestamps and speakers. Use the following format:
<example>
[00:17] Agent (male): Yes, sir. So, you have a shop that sells medicines, fertilizers, and seeds?
[00:19] Customer (male): Hmm.
[00:21] Agent (male): Sir, I have this app, sir, for retailers.
</example>
Hi I was reading the vertex ai documentation - [https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/audio-understanding#:~:text=files%2C%20enable%20the-,audioTimestamp,-parameter%20in%20GenerationConfig]
Here it is mentioned:
2. Audio-only timestamps: To accurately generate timestamps for audio-only files, you must configure the audio_timestamp parameter in generation_config.
But I am not able to set this parameter in generation_config.
The text was updated successfully, but these errors were encountered: