Update README.md

main
randaller 3 years ago committed by GitHub
parent 2e069ba154
commit 55b37c9133
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

@ -1,6 +1,6 @@
# Inference LLaMA models using CPU only
This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) ([arXiv](https://arxiv.org/abs/2302.13971v1)) models and run inference by using only CPU. No videocard is needed.
This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) ([arXiv](https://arxiv.org/abs/2302.13971v1)) models and run inference by using only CPU. No videocard is needed, but 64 (or better 128 Gb) of RAM is required.
### Setup
In a conda env with pytorch / cuda available, run

Loading…
Cancel
Save