New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
[QuantNoise] QuantNoise convertion for custom / own model outside of fairseq?
needs triage
question
#2236
opened Jun 11, 2020 by
Oskop
gelu_accurate: nan gradient with fp16. add an inner cast to float32?
bug
needs triage
#2235
opened Jun 11, 2020 by
vadimkantorov
"--print-alignment" argument drastically slows down the generation
bug
needs triage
#2234
opened Jun 11, 2020 by
ladler0320
Help with using Pointer-Generator for abstractive summarization
needs triage
question
#2233
opened Jun 10, 2020 by
griff4692
Problem with training transformer translation model with quant-noise-pq
needs triage
question
#2232
opened Jun 9, 2020 by
jahutwb
checkpoint saving skipped at the end of an epoch
bug
needs triage
#2230
opened Jun 9, 2020 by
jesgim
[BUG] AttributeError: 'float' object has no attribute 'itemsize' in indexed_dataset.py
#2221
opened Jun 8, 2020 by
memeda
BART Abstractive Summarization : What is the maximum number of input tokens ?
needs triage
question
#2220
opened Jun 7, 2020 by
shamanez
Extract embedding from NMT, apply some transformation, use it back to finetune
needs triage
question
#2219
opened Jun 7, 2020 by
nishaahmed
Size Mismatch error when training translation_from_pretrained_xlm
needs triage
question
#2218
opened Jun 7, 2020 by
ajesujoba
[Translation MOE] Different Model Performance with Paper
question
#2212
opened Jun 5, 2020 by
juheeuu
generate for Glue score JFLEG data fails
needs triage
question
#2211
opened Jun 4, 2020 by
NikhilCherian
Is the first target token ignored? (`translation` task w/ `bart.base` arch)
needs triage
question
#2209
opened Jun 4, 2020 by
pltrdy
Improving levt transformer training speed
enhancement
help wanted
needs triage
#2208
opened Jun 4, 2020 by
mingruimingrui
quant_noise.py, process for kernel_size==(1,1) change input weight size ,cause error
bug
needs triage
#2207
opened Jun 4, 2020 by
misslibra
Possibly wrong link of roberta base model with layerdrop ?
documentation
needs triage
#2206
opened Jun 4, 2020 by
zhuango
ReduceLROnPlateau does not support `maximize-best-checkpoint-metric`
bug
#2205
opened Jun 2, 2020 by
pietruh
Previous Next
ProTip!
Follow long discussions with comments:>50.