Qwen2-14B inference garbled #601

kazyun · 2024-09-20T03:43:41Z

System Info

When using Qwen2, executing inference with the engine through the run.py script outputs normally. However, when using Triton for inference, some characters appear garbled, and the output is incomplete compared to the results obtained from using the script. What could be the cause of this issue?

maybe the config.pbtxt cause the problem

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

start triton server

Expected behavior

get the same results with run.py script

actual behavior

When using Qwen2, executing inference with the engine through the run.py script outputs normally. However, when using Triton for inference, some characters appear garbled, and the output is incomplete compared to the results obtained from using the script. What could be the cause of this issue?

additional notes

no

The text was updated successfully, but these errors were encountered:

kazyun · 2024-09-20T08:28:43Z

This issue only occurs when using a streaming request.
payload = {
"text_input": QWEN_PROMPT_TEMPLATE.format(input_text=prompt),
"max_tokens": max_tokens,
"stream": True,
}

response = requests.post(server_url, json=payload, stream=True)

will-jay · 2024-10-28T14:00:30Z

This issue only occurs when using a streaming request. payload = { "text_input": QWEN_PROMPT_TEMPLATE.format(input_text=prompt), "max_tokens": max_tokens, "stream": True, }
response = requests.post(server_url, json=payload, stream=True)

Hi, I have the same problem. Is there any solution?

kazyun added the bug Something isn't working label Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2-14B inference garbled #601

Qwen2-14B inference garbled #601

kazyun commented Sep 20, 2024

kazyun commented Sep 20, 2024

will-jay commented Oct 28, 2024

Qwen2-14B inference garbled #601

Qwen2-14B inference garbled #601

Comments

kazyun commented Sep 20, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

kazyun commented Sep 20, 2024

will-jay commented Oct 28, 2024