Update README.md

3 years ago · a5f9751a2e
parent e9ffe171e7
commit a5f9751a2e
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -25,7 +25,7 @@ One may run with 32 Gb of RAM, but inference will be slow (with the speed of you

 I am running PyArrow version on a [12700k/128 Gb RAM/NVIDIA 3070ti 8Gb/fast huge nvme with 256 Gb swap for 65B model] and getting one token from 30B model in a few seconds.

-For example, **30B model uses around 70 Gb of RAM**. 7B model fits into 18 Gb. 13B model uses 48 Gb.
+For example, **PyArrow 30B model uses around 70 Gb of RAM**. 7B model fits into 18 Gb. 13B model uses 48 Gb.

 If you do not have nvidia videocard, you may use another repo for cpu-only inference: https://github.com/randaller/llama-cpu or [HF 🤗 version](https://github.com/randaller/llama-chat#hugging-face--version-inference--training).