Not working with Gemma2 9B IT #12

pratik443 · 2025-01-16T05:52:26Z

Hello, thank you for the amazing work and code.
So I'm trying to adapt the code to Gemma 2 9B it model, after changing the prompts to chat template required and running the code it gives following error

attn_weights = attn_weights + causal_mask
RuntimeError: The size of tensor a (7538) must match the size of tensor b (46) at non-singleton dimension 3
seems like past_key_values and current inputs is creating this problem, apparently setting usePrompt as True the generation works, but then just using cache with questions on hotpot dataset setting usePrompt as False dosent work

Does this imply that need to make changes in the generate function for some other issue?

SpeedReach · 2025-01-16T09:26:48Z

Hi @pratik443 ,

Could you provide the full script so we could reproduce the error?
My first guess is the KV Cache in Gemma may have different dimension representations, but would need experiment for confirmation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not working with Gemma2 9B IT #12

Not working with Gemma2 9B IT #12

pratik443 commented Jan 16, 2025 •

edited

Loading

SpeedReach commented Jan 16, 2025

Not working with Gemma2 9B IT #12

Not working with Gemma2 9B IT #12

Comments

pratik443 commented Jan 16, 2025 • edited Loading

SpeedReach commented Jan 16, 2025

pratik443 commented Jan 16, 2025 •

edited

Loading