Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Request] Please update rwkv-raven-{1b5, 3b, 7b}-q8f16_0 to _1 #26

Open
MrCsabaToth opened this issue Sep 6, 2023 · 2 comments
Open

Comments

@MrCsabaToth
Copy link

As far as I understood the latest APK requires _1 models? I'd like to try the RWKV, because it didn't work with the original apk. What's the difference between the _0 and _1 versions, are they incompatible?

@MrCsabaToth MrCsabaToth changed the title [Request]: Please update rwkv-raven-{1b5, 3b, 7b}-q8f16_0 to _1 [Request] Please update rwkv-raven-{1b5, 3b, 7b}-q8f16_0 to _1 Sep 6, 2023
@David-Sharma
Copy link
Contributor

David-Sharma commented Sep 7, 2023

I do not believe that there is a difference between the names. As far as I know q8f16 refers to the following q8 - quantization (8 bits) and f16 is Floating Point 16-bit precision. The _1 or _0 refers to the layout.

@GameOverFlowChart
Copy link

GameOverFlowChart commented Oct 2, 2023

Rwkv-5 world is being trained right now, so I would prefer that we wait until it's finished and that that one gets added to Android. Until now no apk has the lib to run raven as it seems like and if we are so late already it really makes sense to wait for rwkv 5.

Also _0 runs faster at least on some hardware.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants