Update README.md

3 years ago · 0e2612680f
parent 1a7b31831b
commit 0e2612680f
1 changed files with 5 additions and 5 deletions
--- a/README.md
+++ b/README.md
@ -2,8 +2,6 @@

 This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) ([arXiv](https://arxiv.org/abs/2302.13971v1)) models and run inference by using only CPU. Thus requires no videocard, but 64 (better 128 Gb) of RAM and modern processor is required.

-At the moment only 7B model inference supported.
-
 ### Conda Environment Setup Example for Windows 10+
 Download and install Anaconda Python https://www.anaconda.com and run Anaconda Prompt
 ```
@ -25,16 +23,18 @@ pip install -e .
 ### Download tokenizer and models
 magnet:?xt=urn:btih:ZXXDAUWYLRUXXBHUYEMS6Q5CE5WA3LVA&dn=LLaMA

-### CPU Inference
-Place tokenizer.model and tokenizer_checklist.chk into [/tokenizer] folder
+### CPU Inference of 7B model
+Place tokenizer.model and tokenizer_checklist.chk into repo's [/tokenizer] folder.

-Place three files of 7B model into [/model] folder
+Place consolidated.00.pth and params.json from 7B torrent folder into repo's [/model] folder.

 Run it:
 ```
 python example-cpu.py
 ```

+### CPU Inference of 13B 30B 65B models
+
 ### Model Card
 See [MODEL_CARD.md](MODEL_CARD.md)