Once your request is approved, you will receive links to download the tokenizer and model files.
Edit the `download.sh` script with the signed url provided in the email to download the model weights and tokenizer.
### CPU Inference
Place tokenizer.model and tokenizer_checklist.chk into /tokenizer folder
Place three files of 7B model into /model folder
Run it:
```
python example-cpu.py
```
### Inference
The provided `example.py` can be run on a single or multi-gpu node with `torchrun` and will output completions for two pre-defined prompts. Using `TARGET_FOLDER` as defined in `download.sh`:
```
@ -43,10 +51,3 @@ See [MODEL_CARD.md](MODEL_CARD.md)
### License
See the [LICENSE](LICENSE) file.
### CPU Inference
Place tokenizer.model and tokenizer_checklist.chk into /tokenizer folder