Releases · FasterDecoding/Medusa

11 Sep 20:35

10e7387

Medusa-v0.1 Latest

Latest

Medusa is a easy-to-use framework that democratizes the acceleration techniques for LLM generation. Medusa-v0.1 uses several extra light-weighted decoding head, and exclude the need for draft model.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: FasterDecoding/Medusa

Medusa-v0.1