Commit History

Author SHA1 Message Date
  Alex Cheema 581856897a clean up unused, formatting 11 months ago
  Alex Cheema 62e3726263 add RTX 20 series to device capabilities 11 months ago
  Alex Cheema ebff636a25 script ot start openwebui 11 months ago
  Alex Cheema 394935711b add all chat endpoints without v1 prefix to support ollama / openwebui. related: #175 11 months ago
  Alex Cheema 70172d7cb9 add /v1/models endpoint and change Content-Type of stremed response to text/event-stream. fixes #175 11 months ago
  Alex Cheema d917778e2b update mlx to 0.17.1 (not sure where 0.17.0 went on PyPi disappeared)g 11 months ago
  Alex Cheema 2667c8af44 cleaner download_progress 11 months ago
  Alex Cheema f46d077beb fix font dependencies for tinychat. related: #172 11 months ago
  Alex Cheema 8a4928f80c fix gitignore to not ignore tinychat static files 11 months ago
  Alex Cheema d515d9efa3 explicitly use absolute paths for tinychat deps 11 months ago
  Alex Cheema a386c35fde script to update tinychat deps 11 months ago
  Alex Cheema 3791e669a4 download tinychat dependencies all to local dir so we dont need internet 11 months ago
  Alex Cheema 85bab25ac0 fix local check if dir does not exist 11 months ago
  Alex Cheema 59c4393d95 first try loading tokenizer from local path instead of always going to the internet first. significant speed ups 11 months ago
  Alex Cheema 784e6bae21 print traceback on topology collection error 11 months ago
  Alex Cheema 8cad0e1849 only use_fast tokenizer for Mistral Large until this inconsistency bug is fixed #171 11 months ago
  Alex Cheema 85279007b3 hotfix edge case where we try to render before tokenizer is set 11 months ago
  Alex Cheema 09a8468395 upgrade mlx to 0.17.0 11 months ago
  Alex Cheema 1f9d16ec78 run tokenizers test in ci, run all models available 11 months ago
  Alex Cheema 6243846eeb ci logs 11 months ago
  Alex Cheema cfe980bdaa simplify ci 11 months ago
  Alex Cheema 9513c4fd17 ci tail log files 11 months ago
  Alex Cheema 7a02acdcd5 fix ci output streaming 11 months ago
  Alex Cheema ad695696a5 run on every commit on main, reuqire approval on other branches 11 months ago
  Alex Cheema 710e5a31e7 TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit 11 months ago
  Alex Cheema e17e5f9a41 tests for tokenizers. unfortunately use_fast=False and use_fast=True give different behaviour 11 months ago
  Alex Cheema 0d218e244e use fast AutoProcessor fixes #164 tokenizer issues with mistral-large. 11 months ago
  Alex Cheema 23ae5e92c5 hold circleci tests for approval on non-main branches 11 months ago
  Alex Cheema d54944f4ca stream outputs from chatgpt api integration test 11 months ago
  Alex Cheema 1133e27ad3 Merge pull request #166 from exo-explore/formatting 11 months ago