Skip to content

Pull requests: turboderp-org/exllamav2

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix ext.py silently ignoring ImportErrors
#774 opened Apr 24, 2025 by ThisIsPIRI Loading…
Llama-3_1-Nemotron 51B support
#726 opened Jan 28, 2025 by ymcki Loading…
Add ExLlamaV2Sampler.Settings.logits_processor
#634 opened Sep 23, 2024 by lapp0 Loading…
added option for tokenized input to dynamic generator
#613 opened Sep 2, 2024 by KT313 Loading…
Adding stream to 1 kernel.
#590 opened Aug 14, 2024 by Narsil Loading…
Simple QuaRot proof of concept.
#407 opened Apr 11, 2024 by sgsdxzy Loading…
Refactor token healing initialization.
#330 opened Feb 10, 2024 by bjj Loading…
Repeat layers to create FrankenModels
#275 opened Jan 12, 2024 by dnhkng Loading…
add QuiP quant support
#217 opened Dec 7, 2023 by waters222 Loading…
Adding return_lowest_perplexity
#206 opened Dec 3, 2023 by ziadloo Loading…
Add copilot server example
#23 opened Sep 13, 2023 by chenhunghan Loading…
ProTip! no:milestone will show everything without a milestone.