-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix XGLM loss computation (PyTorch and TensorFlow)
#35878
opened Jan 24, 2025 by
damianoamatruda
Loading鈥�
Fix lost loss values when using user-defined compute_loss_func in some cases
#35872
opened Jan 24, 2025 by
dolphin-Dang
Loading鈥�
Add default TP plan for all models with backend support
#35870
opened Jan 24, 2025 by
Cyrilvallez
Loading鈥�
[docs] no hard coding cuda as bnb has multi-backend support
#35867
opened Jan 24, 2025 by
faaany
Loading鈥�
Fix device mismatch error in Whisper model during feature extraction
#35866
opened Jan 24, 2025 by
thedebugger
Loading鈥�
Fix PaliGemma Pad Token Masking During Training #35855
#35859
opened Jan 23, 2025 by
sambhavnoobcoder
Loading鈥�
Add utility for Reload Transformers imports cache for development workflow #35508
#35858
opened Jan 23, 2025 by
sambhavnoobcoder
Loading鈥�
馃毃馃毃馃毃 image-classification pipeline single-label and multi-label prob type squashing fns (sigmoid vs softmax) are backwards
bug
Core: Pipeline
Internals of the library; Pipeline.
Vision
#35848
opened Jan 22, 2025 by
rwightman
Loading鈥�
Fix Jitter Noise Passing to Experts in Switch Transformers #33969
#35847
opened Jan 22, 2025 by
sambhavnoobcoder
Loading鈥�
Nail in edge case of torch dtype being overriden permantly in the case of an error
#35845
opened Jan 22, 2025 by
muellerzr
Loading鈥�
1 of 5 tasks
Optimize Qwen2VL vision model by precomputing cos/sin embeds before ViT blocks
Multimodal
optimization
#35837
opened Jan 22, 2025 by
li-plus
Loading鈥�
1 of 5 tasks
fix(FA): QKV not being casted to target_dtype for FA with dpo lora
#35834
opened Jan 22, 2025 by
NanoCode012
Loading鈥�
1 of 5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.