[Question] Does MLC support speculative decoding on Android? #1858

calebtung · 2024-02-28T19:37:15Z

I saw speculative decoding mentioned in #1539 . Is this supported for deployment to Android targets?
If I wanted to deploy Llama2-7b with TinyLlama as a draft model, how would I do that?
Would also be happy to contribute to the docs on this.

Thanks.

YiGe-MediaTek · 2024-03-15T06:19:31Z

I am also interested to see the speculative decoding support on Android platform. Any answer on that?

tqchen · 2024-05-11T02:56:47Z

spec decode is coming to MLCEngine #2217 which is being landed soon

calebtung added the question Question about the usage label Feb 28, 2024

tqchen closed this as completed May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Does MLC support speculative decoding on Android? #1858

[Question] Does MLC support speculative decoding on Android? #1858

calebtung commented Feb 28, 2024

YiGe-MediaTek commented Mar 15, 2024

tqchen commented May 11, 2024

[Question] Does MLC support speculative decoding on Android? #1858

[Question] Does MLC support speculative decoding on Android? #1858

Comments

calebtung commented Feb 28, 2024

YiGe-MediaTek commented Mar 15, 2024

tqchen commented May 11, 2024