|
|
|
|
@ -183,7 +183,7 @@ python hf-inference-example.py
|
|
|
|
|
|
|
|
|
|
### Bfloat16 training and inference optimization
|
|
|
|
|
|
|
|
|
|
To save memory you may enable Bfloat16 processing.
|
|
|
|
|
To save CPU RAM or GPU VRAM memory, one may wish to enable Bfloat16 processing.
|
|
|
|
|
|
|
|
|
|
```
|
|
|
|
|
# to save memory use bfloat16 on cpu
|
|
|
|
|
|