Skip to content
View BodhiHu's full-sized avatar
🌴
bodhicitta
🌴
bodhicitta
  • MooreThreads, AMD, SAP
  • Shanghai

Block or report BodhiHu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 174 31 Updated Dec 27, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,264 1,273 Updated Sep 5, 2024

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,795 154 Updated Sep 23, 2024

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python 259 21 Updated May 9, 2024

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python 64 8 Updated Dec 3, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 899 48 Updated Dec 6, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 3,062 176 Updated Jul 19, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,086 149 Updated Aug 1, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,759 2,334 Updated Dec 23, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,714 1,482 Updated Oct 21, 2024

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 7,516 984 Updated Dec 28, 2024

The codes for training sparsity predictor on LLaMA.

Python 16 Updated May 12, 2024

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 853 39 Updated Dec 28, 2024

100% Local AGI with LocalAI

Python 421 67 Updated Jun 23, 2024

Port of OpenAI's Whisper model in C/C++

C++ 36,585 3,747 Updated Dec 24, 2024

Python bindings for llama.cpp

Python 8,341 999 Updated Dec 19, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 3,682 325 Updated Dec 25, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,918 508 Updated Nov 20, 2024

Low-bit LLM inference on CPU with lookup table

C++ 633 48 Updated Dec 6, 2024

3D Visualization of an GPT-style LLM

TypeScript 4,203 462 Updated Aug 24, 2024

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 5,914 382 Updated Dec 29, 2024
Python 306 40 Updated Apr 2, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 8,031 418 Updated Sep 6, 2024

🪀 Lobe CLI Toolbox - AI CLI Toolbox, enhancing git commit and i18n workflow efficiency

TypeScript 299 51 Updated Dec 22, 2024

Zero-dependent. A native nodejs screenshots library for Mac、Windows、Linux.

Rust 313 12 Updated Aug 25, 2024

🚀 Screenshots, word marking, OCR, AI, translation software || 截图、划词、文字识别、AI、翻译软件

TypeScript 2,707 157 Updated Dec 15, 2024

LLM inference in C/C++

C++ 69,890 10,091 Updated Dec 29, 2024

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 14,795 1,531 Updated Dec 29, 2024

MemFree - Hybrid AI Search Engine & AI Page Generator

TypeScript 1,135 178 Updated Dec 28, 2024
Next
Showing results