Alex Cheema
|
7a2fbf22b9
add model selection to tinychat
|
1 year ago |
Alex Cheema
|
bbfd5adc20
add support for llama3.1 (8b, 70b, 405b). bump mlx up to 0.16.0 and mlx-lm up to 0.16.1. fixes #66
|
1 year ago |
Alex Cheema
|
5496cd85f5
Revert "smart model downloading for mlx #16"
|
1 year ago |
Alex Cheema
|
3a230f3b44
smart model downloading for mlx #16
|
1 year ago |
Alex Cheema
|
174cff071e
Merge pull request #58 from jakobdylanc/main
|
1 year ago |
Alex Cheema
|
b0e7dd9d2d
add max-generate-tokens flag fixes #54
|
1 year ago |
JakobDylanC
|
f2f61ccee6
inference engine selection improvements
|
1 year ago |
Alex Cheema
|
4e46232364
add simple prometheus metrics collection, with a prometheus / grafana instance for live dashboard. related: #22
|
1 year ago |
Alex Cheema
|
2e419ba211
Merge pull request #48 from itsknk/intel-mac
|
1 year ago |
itsknk
|
e934664168
implement dynamic inference engine selection
|
1 year ago |
Alex Cheema
|
1fcbe18baa
fix m2 ultra flops
|
1 year ago |
Alex Cheema
|
9d9d257eb2
reduce chatgpt api response timeout in test
|
1 year ago |
Alex Cheema
|
8850187b8a
tell the mofo in the workflow to keep responses concise
|
1 year ago |
Alex Cheema
|
052ee1c7e9
cache isolation per workflow job
|
1 year ago |
Alex Cheema
|
ce41e653c0
check cached files in workflow
|
1 year ago |
Alex Cheema
|
3d82338c21
debug cached files in workflow
|
1 year ago |
Alex Cheema
|
aec58b3b36
remove redaudant discovery check in automated test
|
1 year ago |
Alex Cheema
|
9785e250c0
formatting if
|
1 year ago |
Alex Cheema
|
7708b47020
Merge pull request #49 from apotl/disable-viz-flag
|
1 year ago |
Alex Cheema
|
08b2f37532
test output spacing
|
1 year ago |
Alec Potluri
|
db583a863f
disable tui flag
|
1 year ago |
Alex Cheema
|
821f114bf9
add tests badge
|
1 year ago |
Alex Cheema
|
71b8c660be
test workflow
|
1 year ago |
Alex Cheema
|
6c871562e4
fix huggingface cache
|
1 year ago |
Alex Cheema
|
cf98cc50fa
trigger workflow
|
1 year ago |
Alex Cheema
|
719e149aeb
test trigger workflow
|
1 year ago |
Alex Cheema
|
9d939b3703
disable tinygrad test again, we need a smaller model or a machine with more memory otherwise we get Metal OOM
|
1 year ago |
Alex Cheema
|
774e620973
add space between outputs in github workflow integration test
|
1 year ago |
Alex Cheema
|
a2a7ca1f8b
cleaner node info =
|
1 year ago |
Alex Cheema
|
04f2aa2a65
try with METAL_XCODE=1 for tinygrad metal
|
1 year ago |