Inference R1 with TensorRT-LLM #277

ColdsteelRail · 2025-02-05T10:11:29Z

Is it possible to deploy deepseek-r1 with tensorrt-llm ?

incomingflyingbrick · 2025-02-10T08:07:45Z

same question here!

ColdsteelRail · 2025-02-10T12:42:08Z

Anyone tries to deploy R1 on tensorrt-llm successfully?

I found that README suggests to deploy R1 using the same way as V3. As V3 can be deployed on tensorrt-llm, so R1 should be possibly deployed on tensorrt-llm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference R1 with TensorRT-LLM #277

Inference R1 with TensorRT-LLM #277

ColdsteelRail commented Feb 5, 2025

incomingflyingbrick commented Feb 10, 2025

ColdsteelRail commented Feb 10, 2025 •

edited

Loading

Inference R1 with TensorRT-LLM #277

Inference R1 with TensorRT-LLM #277

Comments

ColdsteelRail commented Feb 5, 2025

incomingflyingbrick commented Feb 10, 2025

ColdsteelRail commented Feb 10, 2025 • edited Loading

ColdsteelRail commented Feb 10, 2025 •

edited

Loading