Logo
Explore Help
Sign In
novarobot
/
RWKV-LM
1
0
Fork
You've already forked RWKV-LM
0
Code Issues Pull Requests Packages Projects Releases Wiki Activity
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
main
0.01
0.02
2.00
4.00
Branches Tags
${ item.name }
Create tag ${ searchTerm }
Create branch ${ searchTerm }
from 'b6b5f4628f'
${ noResults }
RWKV-LM/RWKV-v3
History
BlinkDL b6b5f4628f tips for training, with exponential lr decay 4 years ago
..
cuda RWKV-3 (test deeper models (n_layer >= 12) to see the advantage) 4 years ago
src tips for training, with exponential lr decay 4 years ago
run.py RWKV-3 (test deeper models (n_layer >= 12) to see the advantage) 4 years ago
train.py tips for training, with exponential lr decay 4 years ago
verify.py RWKV-3 (test deeper models (n_layer >= 12) to see the advantage) 4 years ago
Powered by Forgejo Version: 1.19.3-0 Page: 48ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API