PENG Bo
|
e65448716d
|
Update README.md
|
4 years ago |
PENG Bo
|
fffbcb6785
|
Update README.md
|
4 years ago |
PENG Bo
|
2951b2895a
|
Update README.md
|
4 years ago |
PENG Bo
|
e541d93b97
|
Update README.md
|
4 years ago |
PENG Bo
|
6b1ba8a9bd
|
Update README.md
|
4 years ago |
PENG Bo
|
619c8add7a
|
Update README.md
|
4 years ago |
PENG Bo
|
438377af6d
|
Add files via upload
+results
|
4 years ago |
PENG Bo
|
9cf873ebb2
|
Update README.md
|
4 years ago |
PENG Bo
|
af56c2446d
|
Update README.md
|
4 years ago |
PENG Bo
|
ecaf1f98aa
|
Update README.md
|
4 years ago |
PENG Bo
|
e2f3465fe6
|
Update README.md
|
4 years ago |
PENG Bo
|
f11b97ae48
|
Update README.md
|
4 years ago |
BlinkDL
|
88e921bf10
|
+eval code for 27M ppl 1.65 BPC 0.72 enwik8 model
|
4 years ago |
BlinkDL
|
71538e44a9
|
refactoring
|
4 years ago |
BlinkDL
|
5817d265c3
|
no message
|
4 years ago |
BlinkDL
|
0b6aec3da6
|
Default parameters for 8G VRAM
|
4 years ago |
PENG Bo
|
6aefe59c3d
|
Update README.md
|
4 years ago |
PENG Bo
|
da6f35f276
|
Update README.md
|
4 years ago |
PENG Bo
|
72a6f28add
|
Update README.md
|
4 years ago |
PENG Bo
|
d8234047e6
|
Update README.md
|
4 years ago |
PENG Bo
|
5ac82f37be
|
Update README.md
|
4 years ago |
PENG Bo
|
a86ac324de
|
Update README.md
|
4 years ago |
PENG Bo
|
9ca62d3a1e
|
Update README.md
|
4 years ago |
PENG Bo
|
4b1df60e94
|
Update README.md
|
4 years ago |
PENG Bo
|
780bed4e19
|
Update README.md
|
4 years ago |
BlinkDL
|
5f21ddf20d
|
RWKV v2 RNN is here. Probably the strongest LM as of now.
|
4 years ago |
PENG Bo
|
1f189a4034
|
Update README.md
|
4 years ago |
PENG Bo
|
5ac265691f
|
RWKV-v2-RNN introduction
|
4 years ago |
PENG Bo
|
b4fd1a7209
|
Update model.py
|
4 years ago |
PENG Bo
|
c8a751ed8b
|
Update README.md
|
4 years ago |
BlinkDL
|
0a0eae447d
|
+headQK (compatible with 2022-02-15 AI-Writer)
|
4 years ago |
BlinkDL
|
b48aa1d430
|
no message
|
4 years ago |
BlinkDL
|
a19be54bf5
|
no message
|
4 years ago |
BlinkDL
|
fcd01f8851
|
no message
|
4 years ago |
BlinkDL
|
76e241b71e
|
saves vocab.json, and the model every X epoch
|
4 years ago |
PENG Bo
|
689a6a924d
|
Update train.py
|
4 years ago |
PENG Bo
|
34fa2ec81b
|
Update README.md
|
4 years ago |
PENG Bo
|
58bdb908f9
|
Update README.md
|
4 years ago |
PENG Bo
|
3d8d0373b4
|
Update README.md
|
4 years ago |
BlinkDL
|
710d3e34b7
|
better init for RWKV
|
4 years ago |
BlinkDL
|
619ed00e4b
|
misc improvement
|
4 years ago |
PENG Bo
|
a36fc09fea
|
Update README.md
|
4 years ago |
PENG Bo
|
a91084efa9
|
Update README.md
|
4 years ago |
BlinkDL
|
3329161ed7
|
rapid convergence using ZERO initialization
|
4 years ago |
BlinkDL
|
7f391c5758
|
+ RWKV tiny-attn and now it's great for ctx 1024 or 2048
|
4 years ago |
PENG Bo
|
a9f39c112c
|
Update README.md
|
4 years ago |
PENG Bo
|
8fd4601dea
|
Update README.md
|
4 years ago |
BlinkDL
|
9b903db103
|
Merge branch 'main' of https://github.com/BlinkDL/RWKV-LM into main
|
4 years ago |
BlinkDL
|
8aec414db2
|
no message
|
4 years ago |
PENG Bo
|
9e959d0b8a
|
Update README.md
|
4 years ago |