Update README.md

main
randaller 3 years ago committed by GitHub
parent afbb6b12b7
commit abe9ff94bb
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

@ -157,6 +157,8 @@ Confirming that 30B model is able to generate SQL code: https://github.com/randa
## Hugging Face 🤗 version (inference & training) ## Hugging Face 🤗 version (inference & training)
### Inference
Thanks to Yam Peleg, we now have *"No overengineering bullshit"* version. Thanks to Yam Peleg, we now have *"No overengineering bullshit"* version.
You do not need to download torrent or merge weights, as model shards and tokenizer will be downloaded from HF automatically at the first run. They will be cached in [C:\Users\USERNAME\\.cache\huggingface\hub] folder under Windows, so do not forget to clean up to 250 Gb after experiments. You do not need to download torrent or merge weights, as model shards and tokenizer will be downloaded from HF automatically at the first run. They will be cached in [C:\Users\USERNAME\\.cache\huggingface\hub] folder under Windows, so do not forget to clean up to 250 Gb after experiments.

Loading…
Cancel
Save