Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix premature downcast in LlamaRMSNorm
#25421
opened Aug 9, 2023 by
Birch-san
Loading鈥�
2 of 5 tasks
Generate: Load generation config when
device_map is passed
#25413
opened Aug 9, 2023 by
gante
Loading鈥�
Generation: strict generation config validation at save time
#25411
opened Aug 9, 2023 by
gante
Loading鈥�
Inconsistency in PreTrainedModel.resize_token_embeddings When ZeRO3 Is Enabled
#25394
opened Aug 8, 2023 by
sinamoeini
Loading鈥�
1 of 5 tasks
Fix issue with ratio evaluation steps and auto find batch size
#25390
opened Aug 8, 2023 by
muellerzr
Loading鈥�
1 of 5 tasks
[DOCS] Added docstring example for EpsilonLogitsWarper #24783
#25378
opened Aug 8, 2023 by
sanjeevk-os
Loading鈥�
2 tasks done
add docstring examples to Encoder repetition penalty logits processor
#25317
opened Aug 4, 2023 by
rajveer43
Loading鈥�
5 tasks
Fixed "Dynamic" issue in LlamaDynamicNTKScalingRotaryEmbedding
#25308
opened Aug 4, 2023 by
LetianLee
Loading鈥�
2 of 5 tasks
Fix Llama's attention map handling for left padding which causes numerical instability and performance drops
#25284
opened Aug 3, 2023 by
Randolph-zeng
Loading鈥�
[
Docs / BetterTransformer ] Added more details about flash attention + SDPA
#25265
opened Aug 2, 2023 by
younesbelkada
Loading鈥�
Previous Next
ProTip!
Mix and match filters to narrow down what you鈥檙e looking for.