-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HuggingFaceModel #21
base: main
Are you sure you want to change the base?
HuggingFaceModel #21
Conversation
logger.info(f"SHA256 hash of the file: {sha256_value}") | ||
return upload_id, sha256_value | ||
|
||
def upload_fs_to_s3( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This has a lot of duplicates with upload_to_s3. need consolidation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The upload_to_s3 method is currently unused for huggingface model, as it's specifically designed for transferring data from an HTTP stream generated by requests to an S3 bucket.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Example schema:
HuggingFaceModel {
repo_id: "TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ",
files: ["*.json", "model.safetensors"],
inference_devices: ["cpu"], // gpu, tpu, etc
quantization: "GPTQ", // GPTQ, AWQ, GGUF_Q4_0, etc
runtime: "llama.cpp", // vLLM, pytorch, etc
prompt_template: "chatml", // chatml, llama-2, gemma, etc.
}
I have established a list object named SUPPORTED_MODELS_V2 within the supported_model.py file.
If the model name exists in the SUPPORTED_MODELS_V2 list object, the system will employ the new HuggingFaceModel for downloading. Otherwise, it will resort to the old logic.