You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
RWKV-LM/RWKV-v4
BlinkDL 083f9504c6 + bf16 mode (more stable) 3 years ago
..
cuda RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
src + bf16 mode (more stable) 3 years ago
deepspeed.json RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
run.py + bf16 mode (more stable) 3 years ago
train.py + bf16 mode (more stable) 3 years ago
verify.py + bf16 mode (more stable) 3 years ago