diff --git a/README.md b/README.md index 70e9c02..5ed18c2 100755 --- a/README.md +++ b/README.md @@ -17,6 +17,14 @@ pip install -e . Once your request is approved, you will receive links to download the tokenizer and model files. Edit the `download.sh` script with the signed url provided in the email to download the model weights and tokenizer. +### CPU Inference +Place tokenizer.model and tokenizer_checklist.chk into /tokenizer folder +Place three files of 7B model into /model folder +Run it: +``` +python example-cpu.py +``` + ### Inference The provided `example.py` can be run on a single or multi-gpu node with `torchrun` and will output completions for two pre-defined prompts. Using `TARGET_FOLDER` as defined in `download.sh`: ``` @@ -43,10 +51,3 @@ See [MODEL_CARD.md](MODEL_CARD.md) ### License See the [LICENSE](LICENSE) file. - -### CPU Inference -Place tokenizer.model and tokenizer_checklist.chk into /tokenizer folder - -Place three files of 7B model into /model folder - -Run python example-cpu.py