From 8af6289d0c695f6139c80acbbc9341395c401798 Mon Sep 17 00:00:00 2001 From: PENG Bo <33809201+BlinkDL@users.noreply.github.com> Date: Fri, 13 Aug 2021 03:07:13 +0800 Subject: [PATCH] Update README.md --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index bdcdf1f..3ad3f8e 100644 --- a/README.md +++ b/README.md @@ -34,6 +34,10 @@ Moreover we multiply the final output of Time-mix layer by γ(t). The reason for *** +p.s. There is a MHA_pro model in this repo with strong performance. Give it a try :) + +*** + We also propose a new sampling method (as in src/utils.py): (1) Find the max probability p_max after softmax.