Commit Graph

  • c494ed5b94
    Fix off-by-one bug (#115) master-c494ed5 Georgi Gerganov 2023-03-19 19:46:32 +0200
  • c1c7026b47
    Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +0200
  • 467b149761
    Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109) qunash 2023-03-19 20:17:39 +0300
  • 70f01cb863
    Drop trailing new line from file prompts (#80) master-70f01cb Georgi Gerganov 2023-03-19 19:04:44 +0200
  • a4e63b73df
    Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +0200
  • 9e1707218a
    Add "--instruct" argument for usage with Alpaca (#240) master-9e17072 Georgi Gerganov 2023-03-19 18:37:02 +0200
  • 22213a17b5
    Change RMSNorm eps to 1e-6 (#173) master-22213a1 Georgi Gerganov 2023-03-19 17:30:00 +0200
  • d7def1a752
    Warn user if a context size greater than 2048 tokens is specified (#274) master-d7def1a Ronsor 2023-03-18 17:10:47 -0700
  • 6f61c18ec9 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +0100
  • 1e5a6d088d Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +0100
  • 554b541521 Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +0100
  • d3f202d57b
    Remove unused code since n_vocab is model.hparams.n_vocab (#262) master-d3f202d Alex Nguyen 2023-03-18 20:51:49 +0700
  • e03e359730
    fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -0400
  • a81d0c2a17
    Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -0700
  • b2de7f18df
    CI Improvements (#230) anzz1 2023-03-18 09:27:12 +0200
  • 0b5448a3a4
    Implement system polyfill for win32 / posix.1 mmap Justine Tunney 2023-03-17 21:22:40 -0700
  • a292747893
    Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +0100
  • c9f670a177
    Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +0100
  • 4f54609110
    Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +0200
  • e81b9c81c1
    Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +0200
  • 367946c668
    Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +0000
  • 6b0df5ccf3
    add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +0800
  • 2af23d3043
    🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +0100
  • 5b8023d935
    Implement prototype for instant mmap() loading Justine Tunney 2023-03-16 22:03:09 -0700
  • 904d2a8d6a
    Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +0100
  • 721311070e
    Update README.md Georgi Gerganov 2023-03-16 15:00:09 +0200
  • ac15de7895
    Expand "Contributing" section Georgi Gerganov 2023-03-16 08:55:13 +0200
  • 273abc47ff
    Update hot topics - RMSnorm Georgi Gerganov 2023-03-16 07:12:12 +0200
  • 9b4a15b17d
    Fix RMS norm in GGML (#191) Nebula 2023-03-15 19:29:25 -0400
  • 6eac39ba95
    Add RMS norm and use it (#187) hoangmit 2023-03-15 18:41:38 -0400
  • 27944c4206
    fixed typo (#178) moritzbrantner 2023-03-15 21:35:25 +0100
  • 2d15d6c9a9
    add SIGINT support for _WIN32 environments (#120) Rickey Bowers Jr 2023-03-15 13:56:24 -0600
  • 2d64715ad4
    added ctx_size parameter (#148) Justin Suess 2023-03-15 15:42:40 -0400
  • 16b2c61a22
    fixed color reset on exit (#149) Justin Suess 2023-03-15 15:39:38 -0400
  • 977295c700
    Fix potential licensing issue (#126) Musab Gultekin 2023-03-15 22:39:06 +0300
  • 956dfda8ad
    Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-ggml.py (#142) Ronsor 2023-03-15 12:37:50 -0700
  • 113e685d18
    inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -0400
  • 2788f373be
    Get the build working Justine Tunney 2023-03-15 02:52:31 -0700
  • 47857e564c
    Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -0700
  • 60f819a2b1
    Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +0200
  • 97ab2b2578
    Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +0200
  • 2f700a2738
    Add windows to the CI (#98) Sebastián A 2023-03-13 17:29:10 -0300
  • c09a9cfb06
    CMake build in Release by default (#75) Georgi Gerganov 2023-03-13 21:22:15 +0200
  • 7ec903d3c1
    Update contribution section, hot topics, limitations, etc. Georgi Gerganov 2023-03-13 19:21:51 +0200
  • 4497ad819c
    Print system information Georgi Gerganov 2023-03-13 19:15:08 +0200
  • ed6849cc07
    Initial support for CMake (#75) Sebastián A 2023-03-13 14:12:33 -0300
  • 41be0a3b3d
    Add NetBSD support. (#90) Thomas Klausner 2023-03-13 17:40:54 +0100
  • 671d5cac15
    Use fprintf for diagnostic output (#48) Pavol Rusnak 2023-03-13 17:39:56 +0100
  • 84d9015c4a
    Use vdotq_s32 to improve performance (#67) Georgi Gerganov 2023-03-13 18:36:44 +0200
  • 63fd76fbb0
    Reduce model loading time (#43) uint256_t 2023-03-14 01:33:43 +0900
  • 2a20f48efa
    Fix UTF-8 handling (including colors) (#79) Val Kharitonov 2023-03-13 12:24:18 -0400
  • d1f224712d
    Add quantize script for batch quantization (#92) Pavol Rusnak 2023-03-13 17:15:20 +0100
  • 1808ee0500
    Add initial contribution guidelines Georgi Gerganov 2023-03-13 09:42:26 +0200
  • a169bb889c Gate signal support on being on a unixoid system. (#74) dev Matvey Soloviev 2023-03-13 04:08:01 +0100
  • 460c482540 Fix token count accounting Matvey Soloviev 2023-03-13 00:35:51 +0100
  • c80e2a8f2a
    Revert "10% performance boost on ARM" Georgi Gerganov 2023-03-13 01:28:08 +0200
  • 54a0e66ea0
    Check for vdotq_s32 availability Georgi Gerganov 2023-03-13 01:21:03 +0200
  • 543c57e991
    Ammend to previous commit - forgot to update non-QRDMX branch Georgi Gerganov 2023-03-13 01:05:24 +0200
  • 113a9e83eb
    10% performance boost on ARM Georgi Gerganov 2023-03-13 00:56:10 +0200
  • 404fac0d62
    Fix color getting reset before prompt output done (#65) Matvey Soloviev 2023-03-12 23:07:34 +0100
  • 1a0a74300f
    Update README.md Georgi Gerganov 2023-03-12 23:39:01 +0200
  • 96ea727f47
    Add interactive mode (#61) Matvey Soloviev 2023-03-12 22:13:28 +0100
  • 9661954835
    Fix typo in README (#45) Marc Köhlbrugge 2023-03-13 03:30:08 +0700
  • f385f8dee8
    Allow using prompt files (#59) Ben Garney 2023-03-12 13:28:36 -0700
  • 02f0c6fe7f
    Add back top_k (#56) beiller 2023-03-12 16:23:15 -0400
  • eb062bb012
    Windows fixes (#31) Sebastián A 2023-03-12 17:15:00 -0300
  • 7027a97837
    Update README.md Georgi Gerganov 2023-03-12 22:09:26 +0200
  • 2d555e5b42
    Add CI (#60) Georgi Gerganov 2023-03-12 22:08:24 +0200
  • 7c9e54e55e
    Revert "weights_only" arg - this causing more trouble than help Georgi Gerganov 2023-03-12 20:59:01 +0200
  • b9bd1d0141
    python/pytorch compat notes (#44) Oleksandr Nikitin 2023-03-12 14:16:33 +0200
  • 129c7d1ea8
    Add repetition penalty (#20) beiller 2023-03-12 05:27:42 -0400
  • 702fddf5c5
    Clarify meaning of hacking Georgi Gerganov 2023-03-12 09:03:25 +0200
  • 7d86e25bf6
    README: add "Supported platforms" + update hot topics Georgi Gerganov 2023-03-12 08:41:54 +0200
  • a93120236f
    use weights_only in conversion script (#32) deepdiffuser 2023-03-11 22:36:35 -0800
  • 6a9a67f0be
    Add LICENSE (#21) Pavol Rusnak 2023-03-12 07:36:03 +0100
  • da1a4ff01f
    Update README.md Georgi Gerganov 2023-03-12 01:26:32 +0200
  • 6b2cb6302f
    Fix a typo in model name (#16) Juraj Bednar 2023-03-11 18:32:20 +0100
  • 4235e3d5b3
    Update README.md Georgi Gerganov 2023-03-11 18:10:18 +0200
  • f1eaff4721 Add AVX2 support for x86 architectures thanks to @Const-me ! Georgi Gerganov 2023-03-11 17:58:18 +0200
  • a9e58529ea Fix un-initialized FP16 tables on x86 (#15, #2) Georgi Gerganov 2023-03-11 17:40:14 +0200
  • 7d9ed7b25f
    Bump memory buffer Georgi Gerganov 2023-03-11 12:44:21 +0200
  • 0c6803321c
    Update README.md Georgi Gerganov 2023-03-11 12:31:21 +0200
  • f60fa9e50a
    .gitignore models/ Georgi Gerganov 2023-03-11 12:26:46 +0200
  • 7211862c94
    Update Makefile var + add comment Georgi Gerganov 2023-03-11 12:26:16 +0200
  • a5c5ae2f54
    Update README.md Georgi Gerganov 2023-03-11 11:34:25 +0200
  • ea977e85ec
    Update README.md Georgi Gerganov 2023-03-11 11:34:11 +0200
  • 007a8f6f45
    Support all LLaMA models + change Q4_0 quantization storage Georgi Gerganov 2023-03-11 10:47:09 +0200
  • 5f2f970d51
    Include Python dependencies in README (#6) Simon Willison 2023-03-10 21:47:26 -0800
  • 73c6ed5e87
    Update README.md Georgi Gerganov 2023-03-11 01:30:47 +0200
  • 01eeed8fb1
    Update README.md Georgi Gerganov 2023-03-11 01:22:58 +0200
  • 6da2df34ee
    Update README.md Georgi Gerganov 2023-03-11 01:18:10 +0200
  • 9dcf4dba45
    Add missing headers for memcpy and assert (#3) Jean-Michaël Celerier 2023-03-10 18:04:06 -0500
  • 920a7fe2d9
    Update README.md Georgi Gerganov 2023-03-11 00:55:22 +0200
  • 3a57ee59de
    Update README.md Georgi Gerganov 2023-03-11 00:51:46 +0200
  • b85028522d
    Update README.md Georgi Gerganov 2023-03-11 00:09:19 +0200
  • 8a01f565ff
    Update README.md Georgi Gerganov 2023-03-10 23:53:11 +0200
  • 70bc0b8b15
    Fix a bug in the rope calculation Georgi Gerganov 2023-03-10 23:46:39 +0200
  • 18ebda34d6
    Update README.md Georgi Gerganov 2023-03-10 21:52:27 +0200
  • 319cdb3e1f
    Final touches Georgi Gerganov 2023-03-10 21:50:46 +0200
  • 775328064e
    Create README.md Georgi Gerganov 2023-03-10 21:47:46 +0200