Commit Verlauf

Autor SHA1 Nachricht Datum
  Alex Cheema 5a9f4ba5c1 update examples: remove old llama3_distributed, add chatgpt_api vor 1 Jahr
  Alex Cheema 581856897a clean up unused, formatting vor 1 Jahr
  Alex Cheema 62e3726263 add RTX 20 series to device capabilities vor 1 Jahr
  Alex Cheema ebff636a25 script ot start openwebui vor 1 Jahr
  Alex Cheema 394935711b add all chat endpoints without v1 prefix to support ollama / openwebui. related: #175 vor 1 Jahr
  Alex Cheema 70172d7cb9 add /v1/models endpoint and change Content-Type of stremed response to text/event-stream. fixes #175 vor 1 Jahr
  Alex Cheema d917778e2b update mlx to 0.17.1 (not sure where 0.17.0 went on PyPi disappeared)g vor 1 Jahr
  Alex Cheema 2667c8af44 cleaner download_progress vor 1 Jahr
  Alex Cheema f46d077beb fix font dependencies for tinychat. related: #172 vor 1 Jahr
  Alex Cheema 8a4928f80c fix gitignore to not ignore tinychat static files vor 1 Jahr
  Alex Cheema d515d9efa3 explicitly use absolute paths for tinychat deps vor 1 Jahr
  Alex Cheema a386c35fde script to update tinychat deps vor 1 Jahr
  Alex Cheema 3791e669a4 download tinychat dependencies all to local dir so we dont need internet vor 1 Jahr
  Alex Cheema 85bab25ac0 fix local check if dir does not exist vor 1 Jahr
  Alex Cheema 59c4393d95 first try loading tokenizer from local path instead of always going to the internet first. significant speed ups vor 1 Jahr
  Alex Cheema 784e6bae21 print traceback on topology collection error vor 1 Jahr
  Alex Cheema 8cad0e1849 only use_fast tokenizer for Mistral Large until this inconsistency bug is fixed #171 vor 1 Jahr
  Alex Cheema 85279007b3 hotfix edge case where we try to render before tokenizer is set vor 1 Jahr
  Alex Cheema 09a8468395 upgrade mlx to 0.17.0 vor 1 Jahr
  Alex Cheema 1f9d16ec78 run tokenizers test in ci, run all models available vor 1 Jahr
  Alex Cheema 6243846eeb ci logs vor 1 Jahr
  Alex Cheema cfe980bdaa simplify ci vor 1 Jahr
  Alex Cheema 9513c4fd17 ci tail log files vor 1 Jahr
  Alex Cheema 7a02acdcd5 fix ci output streaming vor 1 Jahr
  Alex Cheema ad695696a5 run on every commit on main, reuqire approval on other branches vor 1 Jahr
  Alex Cheema 710e5a31e7 TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit vor 1 Jahr
  Alex Cheema e17e5f9a41 tests for tokenizers. unfortunately use_fast=False and use_fast=True give different behaviour vor 1 Jahr
  Alex Cheema 0d218e244e use fast AutoProcessor fixes #164 tokenizer issues with mistral-large. vor 1 Jahr
  Alex Cheema 23ae5e92c5 hold circleci tests for approval on non-main branches vor 1 Jahr
  Alex Cheema d54944f4ca stream outputs from chatgpt api integration test vor 1 Jahr