You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
RWKV-LM/RWKV-v4
BlinkDL 6667ad18c2 more training tips 3 years ago
..
cuda RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
src more training tips 3 years ago
deepspeed.json RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
run.py RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
train.py more training tips 3 years ago
verify.py RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago