昇腾910B上进行推理时，调不了NPU卡 #6671

winni0 · 2025-01-16T09:52:19Z

Reminder

I have read the above rules and searched the existing issues.

System Info

llamafactory version: 0.9.2.dev0
Platform: Linux-5.15.0-127-generic-aarch64-with-glibc2.35
Python version: 3.10.15
PyTorch version: 2.3.1 (NPU)
Transformers version: 4.46.1
Datasets version: 3.1.0
Accelerate version: 1.0.1
PEFT version: 0.12.0
TRL version: 0.9.6
NPU type: Ascend910B3
CANN version: 8.0.RC3
DeepSpeed version: 0.14.4

Reproduction

User: 你是谁？
Assistant: [W compiler_depend.ts:51] Warning: CAUTION: The operator 'aten::isin.Tensor_Tensor_out' is not currently supported on the NPU backend and will fall back to run on the CPU. This may have performance implications. (function npu_cpu_fallback)
/usr/local/python3.10/lib/python3.10/site-packages/transformers/generation/logits_process.py:1634: UserWarning: AutoNonVariableTypeMode is deprecated and will be removed in 1.10 release. For kernel implementations please use AutoDispatchBelowADInplaceOrView instead, If you are looking for a user facing API to enable running your inference-only workload, please use c10::InferenceMode. Using AutoDispatchBelowADInplaceOrView in user code is under risk of producing silent wrong result in some edge cases. See Note [AutoDispatchBelowAutograd] for more details. (Triggered internally at build/CMakeFiles/torch_npu.dir/compiler_depend.ts:74.)
  scores_processed = torch.where(scores != scores, 0.0, scores)

Others

执行的命令是：ASCEND_RT_VISIBLE_DEVICES=0,1,2,3 llamafactory-cli chat
--model_name_or_path /LLaMA-Factory-main/model/Qwen2-VL-2B-Instruct
--template qwen2_vl
--infer_backend huggingface

The text was updated successfully, but these errors were encountered:

1737686924 · 2025-01-16T10:00:20Z

Reminder

I have read the above rules and searched the existing issues.

System Info

llamafactory version: 0.9.2.dev0

Platform: Linux-5.15.0-127-generic-aarch64-with-glibc2.35

Python version: 3.10.15

PyTorch version: 2.3.1 (NPU)

Transformers version: 4.46.1

Datasets version: 3.1.0

Accelerate version: 1.0.1

PEFT version: 0.12.0

TRL version: 0.9.6

NPU type: Ascend910B3

CANN version: 8.0.RC3

DeepSpeed version: 0.14.4

Reproduction
User: 你是谁？
Assistant: [W compiler_depend.ts:51] Warning: CAUTION: The operator 'aten::isin.Tensor_Tensor_out' is not currently supported on the NPU backend and will fall back to run on the CPU. This may have performance implications. (function npu_cpu_fallback)
/usr/local/python3.10/lib/python3.10/site-packages/transformers/generation/logits_process.py:1634: UserWarning: AutoNonVariableTypeMode is deprecated and will be removed in 1.10 release. For kernel implementations please use AutoDispatchBelowADInplaceOrView instead, If you are looking for a user facing API to enable running your inference-only workload, please use c10::InferenceMode. Using AutoDispatchBelowADInplaceOrView in user code is under risk of producing silent wrong result in some edge cases. See Note [AutoDispatchBelowAutograd] for more details. (Triggered internally at build/CMakeFiles/torch_npu.dir/compiler_depend.ts:74.)
  scores_processed = torch.where(scores != scores, 0.0, scores)
Others

执行的命令是：ASCEND_LAUNCH_BLOCKING=0,1,2,3 llamafactory-cli chat --model_name_or_path /LLaMA-Factory-main/model/Qwen2-VL-2B-Instruct --template qwen2_vl --infer_backend huggingface

ASCEND_RT_VISIBLE_DEVICE，指定npu显卡是这个

winni0 · 2025-01-16T10:05:13Z

改了，但是还是调不了卡

codemayq · 2025-01-17T01:12:08Z

有这个 Warning: CAUTION: The operator 'aten::isin.Tensor_Tensor_out' is not currently supported on the NPU backend and will fall back to run on the CPU. This may have performance implications. (function npu_cpu_fallback) 说明已经在调用了呀

winni0 added bug pending labels Jan 16, 2025

github-actions bot added the npu label Jan 16, 2025

hiyouga added solved and removed bug pending labels Jan 17, 2025

hiyouga closed this as completed Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

昇腾910B上进行推理时，调不了NPU卡 #6671

昇腾910B上进行推理时，调不了NPU卡 #6671

winni0 commented Jan 16, 2025 •

edited

Loading

1737686924 commented Jan 16, 2025

Reminder

System Info

Reproduction

Others

winni0 commented Jan 16, 2025

codemayq commented Jan 17, 2025

昇腾910B上进行推理时，调不了NPU卡 #6671

昇腾910B上进行推理时，调不了NPU卡 #6671

Comments

winni0 commented Jan 16, 2025 • edited Loading

Reminder

System Info

Reproduction

Others

1737686924 commented Jan 16, 2025

Reminder

System Info

Reproduction

Others

winni0 commented Jan 16, 2025

codemayq commented Jan 17, 2025

winni0 commented Jan 16, 2025 •

edited

Loading