Skip to content

Commit

Permalink
Added Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002 (#3032)
Browse files Browse the repository at this point in the history
  • Loading branch information
liamjxu authored Oct 15, 2024
1 parent e937c98 commit c09fe75
Show file tree
Hide file tree
Showing 2 changed files with 34 additions and 0 deletions.
18 changes: 18 additions & 0 deletions src/helm/config/model_deployments.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -495,6 +495,24 @@ model_deployments:
args:
safety_settings_preset: block_none

- name: google/gemini-1.5-pro-002
model_name: google/gemini-1.5-pro-002
tokenizer_name: google/gemma-2b # Gemini has no tokenizer endpoint, so we approximate by using Gemma's tokenizer.
max_sequence_length: 1000000 # Source: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models#gemini-models
# TODO: Max output tokens: 8192
client_spec:
class_name: "helm.clients.vertexai_client.VertexAIChatClient"

- name: google/gemini-1.5-flash-002
model_name: google/gemini-1.5-flash-002
tokenizer_name: google/gemma-2b # Gemini has no tokenizer endpoint, so we approximate by using Gemma's tokenizer.
max_sequence_length: 1000000 # Source: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models#gemini-models
# TODO: Max output tokens: 8192
client_spec:
class_name: "helm.clients.vertexai_client.VertexAIChatClient"
args:
safety_settings_preset: default

## Gemma
- name: together/gemma-2b
model_name: google/gemma-2b
Expand Down
16 changes: 16 additions & 0 deletions src/helm/config/model_metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -933,6 +933,22 @@ models:
release_date: 2024-05-24
tags: [TEXT_MODEL_TAG, VISION_LANGUAGE_MODEL_TAG, GOOGLE_GEMINI_MODEL_TAG, LIMITED_FUNCTIONALITY_TEXT_MODEL_TAG, INSTRUCTION_FOLLOWING_MODEL_TAG]

- name: google/gemini-1.5-pro-002
display_name: Gemini 1.5 Pro (002)
description: Gemini 1.5 Pro is a multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from long contexts. This model is accessed through Vertex AI and has all safety thresholds set to `BLOCK_NONE`. ([paper](https://arxiv.org/abs/2403.05530))
creator_organization_name: Google
access: limited
release_date: 2024-09-24
tags: [TEXT_MODEL_TAG, VISION_LANGUAGE_MODEL_TAG, GOOGLE_GEMINI_MODEL_TAG, LIMITED_FUNCTIONALITY_TEXT_MODEL_TAG, INSTRUCTION_FOLLOWING_MODEL_TAG]

- name: google/gemini-1.5-flash-002
display_name: Gemini 1.5 Flash (002)
description: Gemini 1.5 Flash is a multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from long contexts. This model is accessed through Vertex AI and has all safety thresholds set to `BLOCK_NONE`. ([paper](https://arxiv.org/abs/2403.05530))
creator_organization_name: Google
access: limited
release_date: 2024-09-24
tags: [TEXT_MODEL_TAG, VISION_LANGUAGE_MODEL_TAG, GOOGLE_GEMINI_MODEL_TAG, LIMITED_FUNCTIONALITY_TEXT_MODEL_TAG, INSTRUCTION_FOLLOWING_MODEL_TAG]

- name: google/gemma-2b
display_name: Gemma (2B)
description: Gemma is a family of lightweight, open models built from the research and technology that Google used to create the Gemini models. ([model card](https://www.kaggle.com/models/google/gemma), [blog post](https://blog.google/technology/developers/gemma-open-models/))
Expand Down

0 comments on commit c09fe75

Please sign in to comment.