Skip to content

Conversation

@dkun7944
Copy link

@dkun7944 dkun7944 commented Jun 4, 2023

#1493

Added a tensor layer number parameter to ggml_tensor. 0 is used for global tensors

@ggerganov
Copy link
Member

See my latest comment: #1493 (comment)

@mofosyne mofosyne added enhancement New feature or request Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level labels May 10, 2024
@mofosyne
Copy link
Collaborator

obsolete?

@slaren slaren closed this May 10, 2024
wbruna added a commit to wbruna/llama.cpp that referenced this pull request Aug 22, 2025
* Support q8_0 quantization for image model loading

q4_0 may degrade quality significantly, especially for smaller
models like SD 1.5 and SDXL. q8_0 provides a middle-ground,
giving half the memory savings of q4_0 but loading faster and
with less quality loss.

* Accept --sdquant with no parameters

* Use numerical values for the sdquant option
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants