Some requests to gemini models are being stuck #4509

romantsovmike · 2024-10-04T07:20:50Z

Environment details

OS type and version: docker container based on python:3.11-slim
Python version: 3.11
pip version: 24.1.1
google-cloud-aiplatform version: 1.67.1

Code example

Hi, we are using gemini models (pro, flash) on VertexAI platform and some of the async requests are being stuck forever. The code of calling the model is the following:

import vertexai
from vertexai.preview.generative_models import GenerativeModel

vertexai.init(project=..., location=...)
vision_model = GenerativeModel(model_name)

...

result = await vision_model.generate_content_async(
    contents=content_to_model,
    safety_settings=safety_config,
    generation_config=generation_config,
    stream=False,
)

There is no possibility to set the request timeout for this call, so we created our own one with the following code:

result = await asyncio.wait_for(
    vision_model.generate_content_async(
        contents=content_to_model,
        safety_settings=safety_config,
        generation_config=generation_config,
        stream=False,
    ),
    timeout=65
)

Some of the long-running calls are being caught by this timeout and we were able to retry the method, but some of them are still stuck forever for some reason.

Looks like there is some kind of thread locking inside of the async method from library. Something like the following code:

async def sleep_sync(timeout):
    time.sleep(timeout)
    return timeout

async def sleep_async(timeout):
    await asyncio.sleep(timeout)
    return timeout

# No locking, when timeout is reached, we receive exception
await asyncio.wait_for(
    sleep_async(10),
    timeout=4
)

# This code is being locked by synchronous time.sleep method
await asyncio.wait_for(
    sleep_sync(10),
    timeout=4
)

Stack trace

No stack trace available because the code stuck

The text was updated successfully, but these errors were encountered:

romantsovmike · 2024-10-07T08:01:46Z

By the way, it would be great if users will be able to pass timeout argument to the generate_content_async method.

product-auto-label bot added the api: vertex-ai Issues related to the googleapis/python-aiplatform API. label Oct 4, 2024

jaycee-li assigned Ark-kun Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some requests to gemini models are being stuck #4509

Some requests to gemini models are being stuck #4509

romantsovmike commented Oct 4, 2024

romantsovmike commented Oct 7, 2024

Some requests to gemini models are being stuck #4509

Some requests to gemini models are being stuck #4509

Comments

romantsovmike commented Oct 4, 2024

Environment details

Code example

Stack trace

romantsovmike commented Oct 7, 2024