From dd51d4b491e7444930f2565d49e51757cc8b8f3d Mon Sep 17 00:00:00 2001
From: randaller <randaller@users.noreply.github.com>
Date: Sat, 4 Mar 2023 13:43:42 +0300
Subject: [PATCH] Update README.md

---
 README.md | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 0e31191..9d622d5 100755
--- a/README.md
+++ b/README.md
@@ -1,4 +1,4 @@
-# LLaMA 
+# Inference LLaMA models using CPU only
 
 This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) ([arXiv](https://arxiv.org/abs/2302.13971v1)) models and run inference.
 In order to download the checkpoints and tokenizer, fill this [google form](https://forms.gle/jk851eBVbX1m5TAv5)
@@ -43,3 +43,8 @@ See [MODEL_CARD.md](MODEL_CARD.md)
 
 ### License
 See the [LICENSE](LICENSE) file.
+
+### CPU Inference
+Place tokenizer.model and tokenizer_checklist.chk into /tokenizer folder
+Place three files of 7B model into /model folder
+Run python example-cpu.py