コミット履歴

作者 SHA1 メッセージ 日付
  Alex Cheema 8cad0e1849 only use_fast tokenizer for Mistral Large until this inconsistency bug is fixed #171 11 ヶ月 前
  Alex Cheema 85279007b3 hotfix edge case where we try to render before tokenizer is set 11 ヶ月 前
  Alex Cheema 09a8468395 upgrade mlx to 0.17.0 11 ヶ月 前
  Alex Cheema 1f9d16ec78 run tokenizers test in ci, run all models available 11 ヶ月 前
  Alex Cheema 6243846eeb ci logs 11 ヶ月 前
  Alex Cheema cfe980bdaa simplify ci 11 ヶ月 前
  Alex Cheema 9513c4fd17 ci tail log files 11 ヶ月 前
  Alex Cheema 7a02acdcd5 fix ci output streaming 11 ヶ月 前
  Alex Cheema ad695696a5 run on every commit on main, reuqire approval on other branches 11 ヶ月 前
  Alex Cheema 710e5a31e7 TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit 11 ヶ月 前
  Alex Cheema e17e5f9a41 tests for tokenizers. unfortunately use_fast=False and use_fast=True give different behaviour 11 ヶ月 前
  Alex Cheema 0d218e244e use fast AutoProcessor fixes #164 tokenizer issues with mistral-large. 11 ヶ月 前
  Alex Cheema 23ae5e92c5 hold circleci tests for approval on non-main branches 11 ヶ月 前
  Alex Cheema d54944f4ca stream outputs from chatgpt api integration test 11 ヶ月 前
  Alex Cheema 1133e27ad3 Merge pull request #166 from exo-explore/formatting 11 ヶ月 前
  Alex Cheema f53056dede more compact operator formatting 11 ヶ月 前
  Alex Cheema 14f2846a9c yapf set blank_line_before_nested_class_or_def to false 11 ヶ月 前
  Alex Cheema ea70c9fb76 reformat with yapf format.py 11 ヶ月 前
  Alex Cheema 2e27076665 simplify formatting with yapf 11 ヶ月 前
  Alex Cheema 417114fae4 fix mistral nemo 11 ヶ月 前
  Alex Cheema 5101f03369 keep 4 in RotatingKVCache 11 ヶ月 前
  Alex Cheema 6db73fab1c laptop gpu device capabilites 11 ヶ月 前
  Alex Cheema 647ffb94eb increase cli generation timeout 11 ヶ月 前
  Alex Cheema dd24e7db1e only ignore CancelledError inside stop 11 ヶ月 前
  Alex Cheema 2e1233357c ignore CancelledError when stopping the server 11 ヶ月 前
  Alex Cheema dfa3fdcf08 Merge pull request #161 from exo-explore/cli 11 ヶ月 前
  Alex Cheema ae35ada19b fix headless mode with --disable-tui 11 ヶ月 前
  Alex Cheema b95916e0b5 show prompts and outputs in tui 11 ヶ月 前
  Alex Cheema e84304317c add a cli that can be triggered with --run-model <model> --prompt <prompt> 11 ヶ月 前
  Alex Cheema cea9b48d24 update mlx-lm to 0.17.0, use lru caches for kv_cache with RotatingKVCache to optimise memory fixes #158 11 ヶ月 前