Commit Graph

5 Commits (4870e455b3653f7d7769fa5772b2c90ffad088df)

Author SHA1 Message Date
Georgi Gerganov 4870e455b3
Fix memory allocation issues and seg faults 3 years ago
Georgi Gerganov 483bab2e3d
Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)
Should make results reproducible for different number of threads and batch sizes
3 years ago
Yusuf Kağan Hanoğlu d5850c53ca
Add missing header for memcpy (#386)
fixed: memcpy is not defined
3 years ago
Georgi Gerganov 928480ef5b
Init llama_context_params properly from CLI (#370) 3 years ago
Georgi Gerganov f5a77a629b
Introduce C-style API (#370)
* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning
3 years ago