8 Commits (2bddd576cd6997d0d9ab447cfd24dcd498d1d067)

Author SHA1 Message Date
BlinkDL 68c486ad10 supports RWKV-4 pile models 4 years ago
BlinkDL 61b7c429df no message 4 years ago
BlinkDL 7cdc8d3164 no message 4 years ago
BlinkDL 13bb641007 no message 4 years ago
BlinkDL f79137b524 supports megatron bin+idx format 4 years ago
BlinkDL 083f9504c6 + bf16 mode (more stable) 4 years ago
BlinkDL 6667ad18c2 more training tips 4 years ago
BlinkDL 165dfd1b9e RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 4 years ago