Commit History

Autor SHA1 Mensaxe Data
  Alex Cheema 7a2fbf22b9 add model selection to tinychat hai 1 ano
  Alex Cheema bbfd5adc20 add support for llama3.1 (8b, 70b, 405b). bump mlx up to 0.16.0 and mlx-lm up to 0.16.1. fixes #66 hai 1 ano
  Alex Cheema 5496cd85f5 Revert "smart model downloading for mlx #16" hai 1 ano
  Alex Cheema 3a230f3b44 smart model downloading for mlx #16 hai 1 ano
  Alex Cheema 174cff071e Merge pull request #58 from jakobdylanc/main hai 1 ano
  Alex Cheema b0e7dd9d2d add max-generate-tokens flag fixes #54 hai 1 ano
  JakobDylanC f2f61ccee6 inference engine selection improvements hai 1 ano
  Alex Cheema 4e46232364 add simple prometheus metrics collection, with a prometheus / grafana instance for live dashboard. related: #22 hai 1 ano
  Alex Cheema 2e419ba211 Merge pull request #48 from itsknk/intel-mac hai 1 ano
  itsknk e934664168 implement dynamic inference engine selection hai 1 ano
  Alex Cheema 1fcbe18baa fix m2 ultra flops hai 1 ano
  Alex Cheema 9d9d257eb2 reduce chatgpt api response timeout in test hai 1 ano
  Alex Cheema 8850187b8a tell the mofo in the workflow to keep responses concise hai 1 ano
  Alex Cheema 052ee1c7e9 cache isolation per workflow job hai 1 ano
  Alex Cheema ce41e653c0 check cached files in workflow hai 1 ano
  Alex Cheema 3d82338c21 debug cached files in workflow hai 1 ano
  Alex Cheema aec58b3b36 remove redaudant discovery check in automated test hai 1 ano
  Alex Cheema 9785e250c0 formatting if hai 1 ano
  Alex Cheema 7708b47020 Merge pull request #49 from apotl/disable-viz-flag hai 1 ano
  Alex Cheema 08b2f37532 test output spacing hai 1 ano
  Alec Potluri db583a863f disable tui flag hai 1 ano
  Alex Cheema 821f114bf9 add tests badge hai 1 ano
  Alex Cheema 71b8c660be test workflow hai 1 ano
  Alex Cheema 6c871562e4 fix huggingface cache hai 1 ano
  Alex Cheema cf98cc50fa trigger workflow hai 1 ano
  Alex Cheema 719e149aeb test trigger workflow hai 1 ano
  Alex Cheema 9d939b3703 disable tinygrad test again, we need a smaller model or a machine with more memory otherwise we get Metal OOM hai 1 ano
  Alex Cheema 774e620973 add space between outputs in github workflow integration test hai 1 ano
  Alex Cheema a2a7ca1f8b cleaner node info = hai 1 ano
  Alex Cheema 04f2aa2a65 try with METAL_XCODE=1 for tinygrad metal hai 1 ano