Update README.md

5 years ago · 64fdb61056
parent 959115a7e6
commit 64fdb61056
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -10,7 +10,7 @@ alt="\begin{align*}
 \end{align*}
 ">
-* Here R, K, V is generated by linear transforms of input. Basically RWKV decomposes attention into R(target) * W(src -> target) * K(src). So I call R "receptance", and sigmoid means it's in 0~1 range.
+* Here R, K, V are generated by linear transforms of input, and W is parameter. Basically RWKV decomposes attention into R(target) * W(src, target) * K(src). So we can call R "receptance", and sigmoid means it's in 0~1 range.
 * The Time-mix is similar to AFT (https://arxiv.org/abs/2105.14103). There are two differences.