File tree
822 files changed
+59309
-15464
lines changed- .buildkite
- nightly-benchmarks
- scripts
- tests
- scripts
- hardware_ci
- tpu
- .github
- workflows
- benchmarks
- cutlass_benchmarks
- kernels
- deepgemm
- cmake
- external_projects
- csrc
- attention
- mla
- cpu
- sgl-kernels
- cutlass_extensions/gemm/collective
- mamba
- causal_conv1d
- mamba_ssm
- moe
- marlin_moe_wna16
- permute_unpermute_kernels
- prepare_inputs
- quantization
- compressed_tensors
- cutlass_w8a8
- c3x
- moe
- fp4
- fp8/amd
- gguf
- gptq_marlin
- gptq
- machete
- quickreduce
- rocm
- sparse/cutlass
- docker
- docs
- ci
- cli
- community
- configuration
- contributing
- model
- deployment
- frameworks
- integrations
- design
- kernel
- v1
- features
- quantization
- getting_started
- installation
- cpu
- gpu
- mkdocs
- javascript
- stylesheets
- models
- extensions
- hardware_supported_models
- serving
- integrations
- usage
- examples
- offline_inference
- basic
- openai_batch
- profiling_tpu
- online_serving
- disaggregated_serving_p2p_nccl_xpyd
- opentelemetry
- structured_outputs
- others
- lmcache
- requirements
- tests
- async_engine
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- core
- block/e2e
- cuda
- distributed
- engine
- entrypoints
- llm
- openai
- correctness
- kernels
- attention
- core
- mamba
- moe
- quantization
- lora
- model_executor
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- mq_llm_engine
- multi_step
- multimodal
- neuron/1_core
- plugins_tests
- plugins/vllm_add_dummy_platform
- vllm_add_dummy_platform
- quantization
- samplers
- spec_decode/e2e
- standalone_tests
- tokenization
- tool_use
- tools
- v1
- core
- e2e
- engine
- entrypoints/openai
- kv_connector
- nixl_integration
- unit
- sample
- spec_decode
- tpu
- worker
- worker
- worker
- tools
- vllm
- assets
- attention
- backends
- mla
- ops
- utils
- benchmarks
- compilation
- core
- distributed
- device_communicators
- eplb
- kv_transfer
- kv_connector
- v1
- p2p
- kv_pipe
- engine
- entrypoints
- cli
- benchmark
- openai
- tool_parsers
- executor
- inputs
- logging_utils
- lora
- punica_wrapper
- model_executor
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels/mixed_precision
- quark
- schemes
- utils
- model_loader
- models
- multimodal
- platforms
- transformers_utils
- configs
- processors
- triton_utils
- usage
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- pool
- sample
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
822 files changed
+59309
-15464
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
34 | 48 | | |
35 | 49 | | |
36 | 50 | | |
37 | 51 | | |
38 | 52 | | |
39 | 53 | | |
40 | | - | |
| 54 | + | |
41 | 55 | | |
42 | 56 | | |
43 | 57 | | |
| |||
119 | 133 | | |
120 | 134 | | |
121 | 135 | | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
122 | 160 | | |
123 | 161 | | |
124 | 162 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | | - | |
| 19 | + | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| |||
Lines changed: 12 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
14 | 15 | | |
15 | 16 | | |
16 | 17 | | |
17 | | - | |
| 18 | + | |
| 19 | + | |
18 | 20 | | |
19 | 21 | | |
20 | 22 | | |
| |||
25 | 27 | | |
26 | 28 | | |
27 | 29 | | |
28 | | - | |
29 | | - | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
30 | 33 | | |
| 34 | + | |
31 | 35 | | |
32 | 36 | | |
33 | 37 | | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
34 | 42 | | |
35 | 43 | | |
36 | 44 | | |
| |||
Lines changed: 66 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
Lines changed: 53 additions & 10 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
| 6 | + | |
6 | 7 | | |
7 | 8 | | |
8 | 9 | | |
| 10 | + | |
9 | 11 | | |
10 | 12 | | |
11 | 13 | | |
| |||
29 | 31 | | |
30 | 32 | | |
31 | 33 | | |
32 | | - | |
33 | | - | |
34 | | - | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
35 | 37 | | |
36 | | - | |
| 38 | + | |
37 | 39 | | |
38 | 40 | | |
39 | 41 | | |
40 | 42 | | |
41 | 43 | | |
42 | 44 | | |
43 | 45 | | |
44 | | - | |
| 46 | + | |
45 | 47 | | |
46 | | - | |
47 | | - | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
48 | 52 | | |
49 | 53 | | |
50 | 54 | | |
51 | | - | |
52 | | - | |
53 | | - | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
54 | 58 | | |
55 | 59 | | |
56 | 60 | | |
| |||
75 | 79 | | |
76 | 80 | | |
77 | 81 | | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
78 | 96 | | |
79 | 97 | | |
80 | 98 | | |
| |||
155 | 173 | | |
156 | 174 | | |
157 | 175 | | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
158 | 197 | | |
159 | 198 | | |
160 | 199 | | |
| |||
200 | 239 | | |
201 | 240 | | |
202 | 241 | | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
203 | 245 | | |
204 | 246 | | |
205 | 247 | | |
| |||
211 | 253 | | |
212 | 254 | | |
213 | 255 | | |
| 256 | + | |
214 | 257 | | |
215 | 258 | | |
216 | 259 | | |
| |||
0 commit comments