@@ -61,20 +61,20 @@ th:not(:first-child) {
6161
6262### Feature x Hardware
6363
64- | Feature | Volta | Turing | Ampere | Ada | Hopper | CPU | AMD | TPU |
65- | -----------------------------------------------------------| ---------------------| -----------| -----------| --------| ------------| --------------------| --------| -----|
66- | [ CP] [ chunked-prefill ] | [ ❌] ( gh-issue:2729 ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
67- | [ APC] ( automatic_prefix_caching.md ) | [ ❌] ( gh-issue:3687 ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
68- | [ LoRA] ( lora.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
69- | [ SD] ( spec_decode.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
70- | CUDA graph | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
71- | [ pooling] ( ../models/pooling_models.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
72- | <abbr title =" Encoder-Decoder Models " >enc-dec</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ |
73- | [ mm] ( multimodal_inputs.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
74- | <abbr title =" Logprobs " >logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
75- | <abbr title =" Prompt Logprobs " >prmpt logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
76- | <abbr title =" Async Output Processing " >async output</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
77- | multi-step | ✅ | ✅ | ✅ | ✅ | ✅ | [ ❌] ( gh-issue:8477 ) | ✅ | ❌ |
78- | best-of | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
79- | beam-search | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
80- | [ prompt-embeds] ( prompt_embeds.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ? | [ ❌] ( gh-issue:25097 ) |
64+ | Feature | Volta | Turing | Ampere | Ada | Hopper | CPU | AMD | TPU | Intel GPU |
65+ | -----------------------------------------------------------| ---------------------| -----------| -----------| --------| ------------| --------------------| --------| -----| ------------ |
66+ | [ CP] [ chunked-prefill ] | [ ❌] ( gh-issue:2729 ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
67+ | [ APC] ( automatic_prefix_caching.md ) | [ ❌] ( gh-issue:3687 ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
68+ | [ LoRA] ( lora.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
69+ | [ SD] ( spec_decode.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | [ 🟠 ] ( gh-issue:26963 ) |
70+ | CUDA graph | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ | [ ❌ ] ( gh-issue:26970 ) |
71+ | [ pooling] ( ../models/pooling_models.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
72+ | <abbr title =" Encoder-Decoder Models " >enc-dec</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ |
73+ | [ mm] ( multimodal_inputs.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | [ 🟠 ] ( gh-issue:26965 ) |
74+ | <abbr title =" Logprobs " >logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
75+ | <abbr title =" Prompt Logprobs " >prmpt logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
76+ | <abbr title =" Async Output Processing " >async output</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ | ✅ |
77+ | multi-step | ✅ | ✅ | ✅ | ✅ | ✅ | [ ❌] ( gh-issue:8477 ) | ✅ | ❌ | ✅ |
78+ | best-of | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
79+ | beam-search | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
80+ | [ prompt-embeds] ( prompt_embeds.md ) | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ? | [ ❌] ( gh-issue:25097 ) | ✅ |
0 commit comments