You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PENG Bo b6403a8aef
RWKV-3 (test deeper models (n_layer >= 12) to see the advantage)
4 years ago
..
timex_cuda.cu RWKV-3 (test deeper models (n_layer >= 12) to see the advantage) 4 years ago
timex_op.cpp RWKV-3 (test deeper models (n_layer >= 12) to see the advantage) 4 years ago