Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ver 4.35.2 transformers.Trainer breaks CUDA AMP support
#27760
opened Nov 29, 2023 by
haixpham
2 of 4 tasks
ZeroDivisionError when training on a single batch of data
#27758
opened Nov 29, 2023 by
tleyden
2 of 4 tasks
How to inference the model with 200k length context
New model
#27755
opened Nov 29, 2023 by
taishan1994
2 tasks done
Pipeline instantiation of model "facebook/nllb-200-distilled-600M" requires source and target language as mandatory
#27753
opened Nov 28, 2023 by
drunkeninja42
2 of 4 tasks
TECO - Temporally Consistent Transformers for Video Generation
New model
#27752
opened Nov 28, 2023 by
amyeroberts
2 tasks done
HF trainer training args: save_only_model does not work together with load_best_model_at_end when using deepspeed
#27751
opened Nov 28, 2023 by
welsh01
2 of 4 tasks
Learning Rate doesn't anneal properly after resume_from_checkpoint
#27749
opened Nov 28, 2023 by
jmzeng
2 of 4 tasks
Trainer fails when using torchrun for distributed run of transformer model wrapped with PEFT
#27744
opened Nov 28, 2023 by
Ahmed-Roushdy
2 of 4 tasks
RuntimeError: Could not infer dtype of JpegImageFile
#27739
opened Nov 28, 2023 by
realbigi
2 of 4 tasks
How to save the generated output of BarkModel to an npz file?
#27737
opened Nov 28, 2023 by
chet-chen
RuntimeError(s) when attempting multi-GPU fine-tuning of IDEFICS with naive model parallelism
#27736
opened Nov 28, 2023 by
willemsenbram
2 of 4 tasks
ZERO loss while finetuning Llama2 usin SFT trainer and the use of collator
#27733
opened Nov 27, 2023 by
Sosycs
Add flag for easily finetuning heads / linear probing to AutoModelforSequenceClassification
#27730
opened Nov 27, 2023 by
0amp
hub_strategy's documentation for checkpoint option is wrong and misleading
#27728
opened Nov 27, 2023 by
omermazig
2 of 4 tasks
Adding support for prompt lookup decoding (variant of assisted generation)
#27722
opened Nov 27, 2023 by
apoorvumang
Batch QuestionAnsweringPipeline prediction with different postprocess_params (e.g. max_answer_len)
#27719
opened Nov 27, 2023 by
KatHaruto
Inquiry about the difference between two Approaches to Mask Infilling using BART Model in the official document
#27711
opened Nov 26, 2023 by
Hyfred
load_in_4bit=True works only with models in safetensors format
#27708
opened Nov 26, 2023 by
danielkorat
2 of 4 tasks
implement TemplateConstraints in class transformers.Constraint
Feature request
Request for a new feature
#27706
opened Nov 26, 2023 by
MrzEsma
Previous Next
ProTip!
Mix and match filters to narrow down what you鈥檙e looking for.