Issues: huggingface/transformers
[Quick poll] Give your opinion on the future of the Hugging F...
#20706
by LysandreJik
was closed Mar 30, 2023
Closed
3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Images in "Efficient Inference on a Single GPU" don't load
#25428
opened Aug 10, 2023 by
osanseviero
4 tasks
MBartForConditionalGeneration doesn't seem to be able to complete the task of filling mask.
#25425
opened Aug 10, 2023 by
5i-wanna-be-the-666
2 of 4 tasks
Possible Bug with KV Caching in Llama (original) model
#25420
opened Aug 9, 2023 by
maximkha
2 of 4 tasks
Abnormally High GPU Memory Consumption with OPT 350M Model Leading to OOM
#25419
opened Aug 9, 2023 by
ayaka14732
1 of 4 tasks
Longformer model: tf.Tensor as a Python bool is not allowed
#25418
opened Aug 9, 2023 by
rdisipio
2 of 4 tasks
AttributeError: module 'jax.numpy' has no attribute 'DeviceArray' in colab
#25417
opened Aug 9, 2023 by
yundaehyuck
4 tasks
[BUG]
ExponentialDecayLengthPenalty decreases negative scores
#25416
opened Aug 9, 2023 by
pokjay
2 of 4 tasks
Different generations during test time and validation time
#25400
opened Aug 9, 2023 by
karths8
4 tasks
accelerator.save_state() will report error while i use accelerate and fsdp
#25397
opened Aug 9, 2023 by
lplzyp
1 of 4 tasks
Why is generation_config.json has a higher priority ?.
#25395
opened Aug 8, 2023 by
vchagari
4 tasks
Training speed slows down to a half when double batchsize
#25385
opened Aug 8, 2023 by
YTianZHU
2 of 4 tasks
pooler of dino-v2 is newly initialized when loading the pre-trained model
#25377
opened Aug 8, 2023 by
garychan22
2 of 4 tasks
[Bug]
low_cpu_mem_usage=True is not working for LLAMA2-70B
#25369
opened Aug 8, 2023 by
dc3671
2 of 4 tasks
Saving with trainer deepspeed zero3 missing config.json and tokenizer files.
#25368
opened Aug 8, 2023 by
zjjMaiMai
Trainer class: using the Accelerate launcher with Deepspeed
#25356
opened Aug 7, 2023 by
nebrelbug
2 of 4 tasks
Add Flax diverse group search
Flax
Good Second Issue
Issues that are more difficult to do than "Good First" issues - give it a try if you want!
#25355
opened Aug 7, 2023 by
sanchit-gandhi
Can't train and load TFGPT2LMHeadModel from disc
#25350
opened Aug 7, 2023 by
danielricks
2 of 4 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.