BlinkDL
|
6ff859db80
|
no message
|
3 years ago |
BlinkDL
|
8cced0383e
|
no message
|
3 years ago |
BlinkDL
|
c84e8fd952
|
bugfix
|
3 years ago |
BlinkDL
|
73b96705d7
|
+ fp32 mode (slow but good for verification)
|
3 years ago |
BlinkDL
|
a1bf15ac40
|
no message
|
3 years ago |
BlinkDL
|
6299c087a4
|
fixed VRAM consumpition
|
3 years ago |
BlinkDL
|
c1f7a72724
|
saves some VRAM for 1 GPU training
|
3 years ago |
BlinkDL
|
68c486ad10
|
supports RWKV-4 pile models
|
3 years ago |
BlinkDL
|
7cdc8d3164
|
no message
|
3 years ago |
BlinkDL
|
f79137b524
|
supports megatron bin+idx format
|
3 years ago |
BlinkDL
|
083f9504c6
|
+ bf16 mode (more stable)
|
3 years ago |
BlinkDL
|
6667ad18c2
|
more training tips
|
3 years ago |
BlinkDL
|
165dfd1b9e
|
RWKV-4 with DeepSpeed & FP16 & Better CUDA Kernel
|
3 years ago |