Commit 926b7aa
committed
Accelerate Utilities (vllm-project#193)
* wip
* add modify_offload_module
* update docs
* WIP
* cleanup functions, begin depreciation
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* remove extra space
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* revert get_offloaded_device
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* update to align_module_device
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* add requires skip for accelerate
* fix per token initialization
* remove align_module_device
* respond to nits
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* Accelerate Utilities Follow-up (vllm-project#224)
* rename
* implement recursive case
* remove print
* support OffloadedWeightsLoader
* add lifecycle docstring
* implement offload_to_weights_map with recursive definition
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* add docstring
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* fix type hint
* add check_accelerate guard
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* make device used by clearer
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* update update_prefix_dict
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* reuse fixture
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* use apply rather than recursion
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
* clearer delete_from_weights_map
* add offload_device argument (vllm-project#228)
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
---------
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>1 parent 9236489 commit 926b7aa
File tree
5 files changed
+708
-91
lines changed- src/compressed_tensors
- quantization/lifecycle
- utils
- tests
- test_quantization/lifecycle
- test_utils
5 files changed
+708
-91
lines changedLines changed: 17 additions & 44 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | | - | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
33 | 37 | | |
34 | 38 | | |
35 | 39 | | |
| |||
112 | 116 | | |
113 | 117 | | |
114 | 118 | | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | | - | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
152 | 123 | | |
153 | 124 | | |
154 | 125 | | |
| |||
169 | 140 | | |
170 | 141 | | |
171 | 142 | | |
172 | | - | |
173 | | - | |
174 | | - | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
175 | 148 | | |
176 | 149 | | |
177 | 150 | | |
| |||
196 | 169 | | |
197 | 170 | | |
198 | 171 | | |
199 | | - | |
| 172 | + | |
200 | 173 | | |
201 | 174 | | |
202 | 175 | | |
203 | 176 | | |
204 | 177 | | |
205 | 178 | | |
206 | 179 | | |
207 | | - | |
| 180 | + | |
208 | 181 | | |
209 | 182 | | |
210 | 183 | | |
| |||
214 | 187 | | |
215 | 188 | | |
216 | 189 | | |
217 | | - | |
| 190 | + | |
218 | 191 | | |
219 | 192 | | |
220 | 193 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
16 | 18 | | |
17 | 19 | | |
18 | 20 | | |
| |||
24 | 26 | | |
25 | 27 | | |
26 | 28 | | |
| 29 | + | |
| 30 | + | |
27 | 31 | | |
28 | 32 | | |
29 | 33 | | |
| |||
122 | 126 | | |
123 | 127 | | |
124 | 128 | | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
125 | 188 | | |
126 | 189 | | |
127 | 190 | | |
| |||
0 commit comments