6 Commits (7cdc8d3164d351cba3cd596dbb0fcc887df1611f)

Author SHA1 Message Date
BlinkDL 7cdc8d3164 no message 3 years ago
BlinkDL 13bb641007 no message 3 years ago
BlinkDL f79137b524 supports megatron bin+idx format 3 years ago
BlinkDL 083f9504c6 + bf16 mode (more stable) 3 years ago
BlinkDL 6667ad18c2 more training tips 3 years ago
BlinkDL 165dfd1b9e RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel 3 years ago