-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Fix flaky L0_batch related tests
PR: ci
Changes to our CI configuration files and scripts
#7999
opened Feb 10, 2025 by
yinggeh
Loading…
6 of 11 tasks
fix:
build-secret
flag not being set breaking build.py
#7993
opened Feb 6, 2025 by
BenjaminBraunDev
Loading…
draft: feat: Add graceful shutdown timer to GRPC frontend
enhancement
New feature or request
grpc
Related to the GRPC server
#7969
opened Jan 27, 2025 by
mattwittwer
•
Draft
5 of 20 tasks
Separate model generation for backends on blackwell clusters
#7966
opened Jan 24, 2025 by
pvijayakrish
Loading…
3 of 20 tasks
docs: update to fix autoscaling example command
#7883
opened Dec 16, 2024 by
mattwittwer
•
Draft
20 tasks
refactor: Update the response queue in the server to reuse response slots
#7879
opened Dec 13, 2024 by
pskiran1
Loading…
5 of 20 tasks
feat: ORCA Format KV Cache Utilization in Inference Response Header
#7839
opened Nov 27, 2024 by
BenjaminBraunDev
Loading…
12 of 22 tasks
refactor: Refactor of L0_backend_python and the env subtest
PR: ci
Changes to our CI configuration files and scripts
PR: refactor
A code change that neither fixes a bug nor adds a feature
#7838
opened Nov 27, 2024 by
nv-kmcgill53
•
Draft
5 of 20 tasks
ci: Enables testing for pull requests
#7828
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
3 of 20 tasks
test: Updates L0 Python API tests to run all test files
#7827
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
4 of 20 tasks
fix: Default max tokens to None for OpenAI frontend.
#7819
opened Nov 20, 2024 by
thealmightygrant
Loading…
4 of 22 tasks
feat: Adding RestrictedFeatures Support to the Python Frontend Bindings
#7775
opened Nov 8, 2024 by
KrishnanPrash
Loading…
docs: Add clarification for label_filename in classification docs
#7766
opened Nov 5, 2024 by
trevoryao
Loading…
7 of 22 tasks
docs: Simplify PR templates
PR: docs
Documentation only changes
#7753
opened Oct 29, 2024 by
yinggeh
Loading…
6 of 11 tasks
[Do not merge!] Build: Remove TRT model generation for V100
#7712
opened Oct 16, 2024 by
pvijayakrish
•
Draft
3 of 20 tasks
fix:Split L0_nomodel_perf into 2 test to ensure better debug-ability and resource util for PA
#7705
opened Oct 15, 2024 by
indrajit96
•
Draft
6 of 19 tasks
test: TC for Metric P0 nv_load_time per model
#7697
opened Oct 14, 2024 by
indrajit96
Loading…
8 of 20 tasks
Build: Update TRT release branch referenced in model gen file
#7693
opened Oct 11, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
Build: Update README and versions for 24.10
#7686
opened Oct 8, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.