You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
RWKV-LM/RWKV-v4
BlinkDL 73b96705d7 + fp32 mode (slow but good for verification) 3 years ago
..
cuda RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
src + fp32 mode (slow but good for verification) 3 years ago
20B_tokenizer.json supports RWKV-4 pile models 3 years ago
run.py supports RWKV-4 pile models 3 years ago
train.py + fp32 mode (slow but good for verification) 3 years ago
verify.py supports RWKV-4 pile models 3 years ago