You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
RWKV-LM/RWKV-v4
BlinkDL 7cdc8d3164 no message 3 years ago
..
cuda RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
src no message 3 years ago
deepspeed.json RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago
run.py supports megatron bin+idx format 3 years ago
train.py no message 3 years ago
verify.py + bf16 mode (more stable) 3 years ago