Web-RWKV-Py

Python binding for web-rwkv.

Todos

Basic V5 inference support
Support V4, V5 and V6
Batched inference

Usage

Install python and rust.
Install maturin by
```
$ pip install maturin
```
Build and install:
```
$ maturin develop --release
```

Try using web-rwkv in python:

import web_rwkv_py as wrp

model = wrp.Model(
   "/path/to/model.st", # model path
   quant=0,             # int8 quantization layers
   quant_nf4=0,         # nf4 quantization layers
)
model.clear_state()
logits = model.run([114, 514])

Advanced Usage

Get, clone and load current state:

logits = model.run([114, 514])
state = model.back_state(wrp.StateDevice.Gpu)
# state = model.back_state(wrp.StateDevice.Cpu)
state_cloned = state.deep_clone()

model.load_state(state_cloned)
logits = model.run([1919, 810])

Return predictions of all tokens (not only the last's):

logits, state = model.run_full([114, 514, 1919, 810], state=None)
assert(len(logits) == 4)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
assets		assets
examples		examples
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-RWKV-Py

Todos

Usage

Advanced Usage

About

Releases

Packages

Contributors 2

Languages

cryscan/web-rwkv-py

Folders and files

Latest commit

History

Repository files navigation

Web-RWKV-Py

Todos

Usage

Advanced Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages