-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix Mask2Former Weight Initialization Issues #35877
#35904
opened Jan 27, 2025 by
sambhavnoobcoder
Loading…
Introduce modular files for speech models
#35902
opened Jan 27, 2025 by
nikosanto13
Loading…
1 of 5 tasks
Several fixes related to rotary position embeddings
#35901
opened Jan 27, 2025 by
mseeger
Loading…
4 of 5 tasks
Fix Gradient Checkpointing for Deberta & Deberta-V2 using PEFT / Adapters
#35898
opened Jan 26, 2025 by
lenglaender
Loading…
1 of 5 tasks
add shared experts for upcoming Granite 4.0 language models
#35894
opened Jan 26, 2025 by
mayank31398
Loading…
Fix synced multi-GPU generation with LLMs and VLMs
#35893
opened Jan 26, 2025 by
ManukyanD
Loading…
1 of 5 tasks
Iterative generation using Input embeds and static cache
#35890
opened Jan 25, 2025 by
yaswanth19
Loading…
1 of 5 tasks
Fix typing in audio_utils.chroma_filter_bank
#35888
opened Jan 25, 2025 by
CalOmnie
Loading…
1 of 5 tasks
Add security validation for ZeroShotClassificationArgumentHandler hypothesis templates #35874
#35886
opened Jan 25, 2025 by
sambhavnoobcoder
Loading…
Fix XGLM loss computation (PyTorch and TensorFlow)
#35878
opened Jan 24, 2025 by
damianoamatruda
Loading…
Fix lost loss values when using user-defined compute_loss_func in some cases
#35872
opened Jan 24, 2025 by
dolphin-Dang
Loading…
Add default TP plan for all models with backend support
#35870
opened Jan 24, 2025 by
Cyrilvallez
Loading…
[docs] no hard coding cuda as bnb has multi-backend support
#35867
opened Jan 24, 2025 by
faaany
Loading…
Fix device mismatch error in Whisper model during feature extraction
#35866
opened Jan 24, 2025 by
thedebugger
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.