Issues: NVIDIA/FasterTransformer
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
GPU Kernel performance test with empty parameter without model load
bug
Something isn't working
#566
opened Apr 18, 2023 by
92hyungjun
Converting Flan-UL2 does not produce encoder files
bug
Something isn't working
#565
opened Apr 17, 2023 by
anshoomehra
Some examples do not appear to use ALIBI bias for models like BLOOM
#558
opened Apr 14, 2023 by
abhi-mosaic
[5.3.0] T5 model under FP16 is generating garbage
bug
Something isn't working
#554
opened Apr 12, 2023 by
lanking520
The memory allocation for class Allocator<AllocatorType::CUDA> is very slow.
bug
Something isn't working
#547
opened Apr 7, 2023 by
hongqing1986
opt smoothquant model got runtime error when int8_mode=2
bug
Something isn't working
#543
opened Apr 4, 2023 by
fishelegs
Could NOT find MPI_CXX (missing: MPI_CXX_LIB_NAMES MPI_CXX_HEADER_DIR MPI_CXX_WORKS)
bug
Something isn't working
#542
opened Apr 4, 2023 by
amazingkmy
GPT-NeoX HuggingFace Converter does not work
bug
Something isn't working
#540
opened Apr 3, 2023 by
ankit-db
sampling doesn't stop after words in stop_words_list are generated
bug
Something isn't working
#528
opened Mar 28, 2023 by
shijie-wu
The FT t5 beam search algorithm generates inconsistent results with HF's
#522
opened Mar 25, 2023 by
shiqingzhangCSU
Previous Next
ProTip!
Adding no:label will show everything without a label.