Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update using_guidance.md
#2901 opened Jan 10, 2025 by nbroad1881 Loading…
CI for: fix crash in torch2.6 if TP=1
#2898 opened Jan 10, 2025 by danieldk Loading…
5 tasks
CI for: chore: Update jsonschema to 0.28.0
#2893 opened Jan 9, 2025 by danieldk Loading…
5 tasks
Give TensorRT-LLMa proper CI/CD 😍
#2886 opened Jan 7, 2025 by mfuntowicz Loading…
fix crash in torch2.6 if TP=1
#2885 opened Jan 7, 2025 by sywangyi Loading…
feat: improve star coder to support multi lora layers
#2883 opened Jan 7, 2025 by drbh Loading…
docs(conceptual/speculation): available links Train Medusa
#2863 opened Dec 23, 2024 by guspan-tanadi Loading…
1 of 5 tasks
Fix docker run in README.md
#2861 opened Dec 20, 2024 by alvarobartt Loading…
2 of 5 tasks
Add fp8 kv cache for ROCm
#2856 opened Dec 18, 2024 by mht-sharma Loading…
5 tasks
Add Flash decoding kernel ROCm
#2855 opened Dec 18, 2024 by mht-sharma Loading…
5 tasks
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
[TRTLLM] Expose finish reason
#2841 opened Dec 13, 2024 by mfuntowicz Loading…
feat: improve qwen2-vl startup
#2802 opened Dec 5, 2024 by drbh Loading…
Update tensor_parallel.py
#2798 opened Dec 3, 2024 by Lacacy Loading…
Install text-generation-server from poetry.lock export
#2786 opened Nov 29, 2024 by alvarobartt Loading…
1 of 5 tasks
Enable qwen2vl video
#2756 opened Nov 18, 2024 by drbh Loading…
9 tasks done
Add llama.cpp backend
#2723 opened Nov 4, 2024 by mfuntowicz Loading…
[WIP] Add gfx1100 support to AMD pytorch build
#2642 opened Oct 13, 2024 by cazlo Draft
1 of 5 tasks
ProTip! Follow long discussions with comments:>50.