Cronologia Commit

Autore SHA1 Messaggio Data
  Alex Cheema 12609cb6e4 integration test for udp discovery with grpc server 11 mesi fa
  Alex Cheema f93f811dcb generalise UDPDiscovery to any kind of PeerHandle that accepts an address. test it 11 mesi fa
  Alex Cheema d4a932e405 fix merge 11 mesi fa
  Alex Cheema 2341aa1acf Merge branch 'main' into better_networking 11 mesi fa
  Alex Cheema 5a9f4ba5c1 update examples: remove old llama3_distributed, add chatgpt_api 11 mesi fa
  Alex Cheema 581856897a clean up unused, formatting 11 mesi fa
  Alex Cheema 62e3726263 add RTX 20 series to device capabilities 11 mesi fa
  Alex Cheema ebff636a25 script ot start openwebui 11 mesi fa
  Alex Cheema 394935711b add all chat endpoints without v1 prefix to support ollama / openwebui. related: #175 11 mesi fa
  Alex Cheema 70172d7cb9 add /v1/models endpoint and change Content-Type of stremed response to text/event-stream. fixes #175 11 mesi fa
  Alex Cheema d917778e2b update mlx to 0.17.1 (not sure where 0.17.0 went on PyPi disappeared)g 11 mesi fa
  Alex Cheema 2667c8af44 cleaner download_progress 11 mesi fa
  Alex Cheema f46d077beb fix font dependencies for tinychat. related: #172 1 anno fa
  Alex Cheema 8a4928f80c fix gitignore to not ignore tinychat static files 1 anno fa
  Alex Cheema d515d9efa3 explicitly use absolute paths for tinychat deps 1 anno fa
  Alex Cheema a386c35fde script to update tinychat deps 1 anno fa
  Alex Cheema 3791e669a4 download tinychat dependencies all to local dir so we dont need internet 1 anno fa
  Alex Cheema 85bab25ac0 fix local check if dir does not exist 1 anno fa
  Alex Cheema 59c4393d95 first try loading tokenizer from local path instead of always going to the internet first. significant speed ups 1 anno fa
  Alex Cheema 784e6bae21 print traceback on topology collection error 1 anno fa
  Alex Cheema 8cad0e1849 only use_fast tokenizer for Mistral Large until this inconsistency bug is fixed #171 1 anno fa
  Alex Cheema 85279007b3 hotfix edge case where we try to render before tokenizer is set 1 anno fa
  Alex Cheema 09a8468395 upgrade mlx to 0.17.0 1 anno fa
  Alex Cheema 1f9d16ec78 run tokenizers test in ci, run all models available 1 anno fa
  Alex Cheema 6243846eeb ci logs 1 anno fa
  Alex Cheema cfe980bdaa simplify ci 1 anno fa
  Alex Cheema 9513c4fd17 ci tail log files 1 anno fa
  Alex Cheema 7a02acdcd5 fix ci output streaming 1 anno fa
  Alex Cheema ad695696a5 run on every commit on main, reuqire approval on other branches 1 anno fa
  Alex Cheema 710e5a31e7 TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit 1 anno fa