From 29a21844ab1d2b280eeb7b165469ab06446ff284 Mon Sep 17 00:00:00 2001 From: PENG Bo <33809201+BlinkDL@users.noreply.github.com> Date: Fri, 13 May 2022 04:00:16 +0800 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 40a1634..aa461db 100644 --- a/README.md +++ b/README.md @@ -8,7 +8,7 @@ So it's combining the best of RNN and transformer - great performance, fast infe Reddit discussion: https://www.reddit.com/r/MachineLearning/comments/umq908/r_rwkvv2rnn_a_parallelizable_rnn_with/ -I am training it on the Pile (https://github.com/BlinkDL/RWKV-v2-RNN-Pile) and it might reach GPT-Neo performance within 100B tokens: +I am training it on the Pile (https://github.com/BlinkDL/RWKV-v2-RNN-Pile) and it shall be able to reach GPT-Neo performance on most tasks within 100B tokens: ![RWKV-v2-430M-Pile](RWKV-v2-430M-Pile.png)