-
Notifications
You must be signed in to change notification settings - Fork 134
Issues: triton-inference-server/fastertransformer_backend
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Milestones
Assignee
Sort
Issues list
Whether fastertransformer supports gpt-2 classification model, such as GPT2ForSequenceClassification?
#171
opened Oct 19, 2023 by
cabbagetalk
No response is received during inference in decoupled mode.
bug
Something isn't working
#169
opened Sep 26, 2023 by
amazingkmy
what is the use of preprocessing & postprossing ? can i start fastertransformer only for bloom model ?
bug
Something isn't working
#168
opened Sep 22, 2023 by
flyingjohn
How to deploy multiple model in a node with multople GPUs
bug
Something isn't working
#165
opened Sep 14, 2023 by
jjjjohnson
Memory usage is doubled when loading a fp16 model into bf16
bug
Something isn't working
#164
opened Sep 6, 2023 by
skyser2003
Throughput (requests per second / RPS) not increasing when scaling up from 1 GPU to 4 GPUs
#163
opened Aug 22, 2023 by
chunyat
Can i stop execution? (w/ Something isn't working
decoupled mode
)
bug
#162
opened Aug 21, 2023 by
Yeom
Do I need to specify ARG SM=80 when building the image manually?
#161
opened Aug 15, 2023 by
sfc-gh-zhwang
huggingface_bert_convert.py can't convert some key
bug
Something isn't working
#152
opened Jul 3, 2023 by
SeungjaeLim
Poll failed for model directory 'ensemble': output 'OUTPUT_0' for ensemble 'ensemble' is not written
#144
opened Jun 13, 2023 by
songkq
Why is it needed to set max_batch_size to 1 under interactive mode?
#143
opened Jun 12, 2023 by
zhypku
Why processing requests of batch size=1 is much slower than batch size>1
#142
opened Jun 8, 2023 by
mapcan
FasterTransformer Backend fails to build using latest version of Triton Server
bug
Something isn't working
#140
opened Jun 2, 2023 by
mshuffett
triton support using factertransfer backend for flan-ul2 and flan-ul2-alpaca-lora
#138
opened May 27, 2023 by
ma-siddiqui
Feature request: Conversion from GPTBigCodeForCausalLM / Starcoder
#132
opened May 19, 2023 by
michaelfeil
Previous Next
ProTip!
Adding no:label will show everything without a label.