This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://research.facebook.com/publications/llama-open-and-efficient-foundation-language-models/) models and run inference.
This repository is intended as a minimal, hackable and readable example to load [LLaMA](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) ([arXiv](https://arxiv.org/abs/2302.13971v1)) models and run inference.
In order to download the checkpoints and tokenizer, fill this [google form](https://forms.gle/jk851eBVbX1m5TAv5)