Use `anthropic.claude-3-5-sonnet-20241022-v2:0` using AWS' `InvokeModel` API #319

rpgeddam · 2025-01-08T19:15:22Z

I want to run a simple text inference against anthropic.claude-3-5-sonnet models. This is not a chat interaction and I would prefer not to use the Messages API. Which means I want to use Bedrock's InvokeModel API rather than the Converse API. (And their streaming equivalents)

As far as I can tell both BedrockChatConverse and BedrockChat now use AWS' converse API under the hood. BedrockLLM doesn't support Claude v3 models.

Is there a way to use InvokeModel API via Langchain?

As an aside, the message structure required by the Converse API is too opinionated and rigid, and causes a lot of issues with Langgraph. The requirement that messages always alternate between user and assistant is a problem when langraph has several agent nodes adding messages in sequence without user input.

The text was updated successfully, but these errors were encountered:

rpgeddam · 2025-01-08T19:20:38Z

This comment seems to say the alternating user/ai requirement is an anthropic limitation rather than a bedrock one :(

# Merge system, human, ai message runs because Anthropic expects (at most) 1
# system message then alternating human/ai messages.

Is my only option here is to reformat my inference inputs into a messages format?

patilvishal0597 · 2025-01-10T00:59:49Z

ChatBedrock supports InvokeModel API. By default, the value of beta_use_converse_api flag is False and Invoke API is called internally. If a user passes this as True, then ChatBedrock switches to using Converse API.

Following is a code snippet that utilizing ChatBedrock to invoke anthropic.claude-3-5-sonnet-20241022-v2:0:

from langchain_aws import ChatBedrock
def main():

   llm = ChatBedrock(
       model_id='us.anthropic.claude-3-5-sonnet-20241022-v2:0',
       region_name="us-east-1",
       model_kwargs={
        "max_tokens": 100,
        "top_p": 0.9,
        "temperature": 0.1,
        },
   )

   messages = [
    (
        "system",
        "You are a helpful assistant that translates English to French. Translate the user sentence.",
    ),
    ("human", "I love going out for a walk when the weather is bright and sunny.")]

   # Invoke the llm
   response = llm.invoke(messages)
   print(response.content)

if __name__ == '__main__':
    main()

Output:

J'aime me promener quand il fait beau et ensoleillé.

Note: The model specified might not be available in all regions. Check Model Availability page of AWS docs.

Notice, I have region_name as us-east-1 but the model is not available in the stated region as per above link. To be able to use the latest Sonnet 3.5 from the us-east-1 region, the model inference profile id is 'us.anthropic.claude-3-5-sonnet-20241022-v2:0', it will route the inference to the region where the model is hosted. Similar thread here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `anthropic.claude-3-5-sonnet-20241022-v2:0` using AWS' `InvokeModel` API #319

Use `anthropic.claude-3-5-sonnet-20241022-v2:0` using AWS' `InvokeModel` API #319

rpgeddam commented Jan 8, 2025

rpgeddam commented Jan 8, 2025

patilvishal0597 commented Jan 10, 2025

Use anthropic.claude-3-5-sonnet-20241022-v2:0 using AWS' InvokeModel API #319

Use anthropic.claude-3-5-sonnet-20241022-v2:0 using AWS' InvokeModel API #319

Comments

rpgeddam commented Jan 8, 2025

rpgeddam commented Jan 8, 2025

patilvishal0597 commented Jan 10, 2025

Use `anthropic.claude-3-5-sonnet-20241022-v2:0` using AWS' `InvokeModel` API #319

Use `anthropic.claude-3-5-sonnet-20241022-v2:0` using AWS' `InvokeModel` API #319