bug: Llama reproduce error with kernl #321

yychen016 · 2023-04-23T08:35:15Z

Description

When trying to use kernl with default Llama 7B on A100 device, I get this error.

Steps to reproduce

import torch
from transformers import LlamaModel, LlamaConfig, LlamaTokenizer, LlamaForCausalLM
from kernl.model_optimization import optimize_model

config  = LlamaConfig()
model = LlamaForCausalLM(config).cuda()
optimize_model(model)

length = 5
input_ids = torch.randint(low=0, high=model.config.vocab_size, size=(1,length)).cuda()
with torch.inference_mode(), torch.cuda.amp.autocast():
	outputs = model.generate(input_ids=input_ids)

print(outputs.shape)

Expected Behavior

A properly working Llama model.

Actual Behavior

The following message occurs:

Your environment

A100

Self-service

I would be willing to help fix this bug myself.

Code of Conduct

I agree to follow this project's Code of Conduct

SinanAkkoyun · 2023-05-29T18:25:55Z

It is the same for me, please help up to use LLaMA models :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Llama reproduce error with kernl #321

bug: Llama reproduce error with kernl #321

yychen016 commented Apr 23, 2023

SinanAkkoyun commented May 29, 2023

bug: Llama reproduce error with kernl #321

bug: Llama reproduce error with kernl #321

Comments

yychen016 commented Apr 23, 2023

Description

Steps to reproduce

Expected Behavior

Actual Behavior

Your environment

Self-service

Code of Conduct

SinanAkkoyun commented May 29, 2023