Alex Cheema
|
c0534b67c3
Merge commit: trigger test
|
7 months ago |
Alex Cheema
|
063964aab3
remove redundant sample_logits, put back opaque status for process_prompt so we have a way of preemptively starting downloads
|
7 months ago |
Alex Cheema
|
804ad4705a
upgrade mlx
|
7 months ago |
Alex Cheema
|
c9ded9ba96
optimise networking, remove bloat
|
7 months ago |
Alex Cheema
|
64365d684f
one two and three m4 pro clusters
|
7 months ago |
Alex Cheema
|
9397464fad
add commit to results
|
7 months ago |
Nel Nibcord
|
08912d1b64
Only collect topology if peers changed
|
7 months ago |
Alex Cheema
|
06c2e236b8
rip out stats bloat
|
7 months ago |
Alex Cheema
|
cb4615c95d
fix SendNewToken
|
7 months ago |
Alex Cheema
|
f55a53ae7e
one token at a time
|
7 months ago |
Gary
|
25b4af70e0
Merge branch 'main' into runners
|
7 months ago |
Alex Cheema
|
a93092105c
set max-generate-tokens to 250
|
7 months ago |
Alex Cheema
|
0c6ab35333
increase timeout of http request in bench.py up to 10 mins
|
7 months ago |
Alex Cheema
|
72be5e4bd5
Merge pull request #556 from exo-explore/fixtestmodelhelpers
|
7 months ago |
Alex Cheema
|
b0e079b36a
fix counts in testmodelhelpers
|
7 months ago |
Alex Cheema
|
e5d54c77a9
add llama-3.3-70b to 3 M4 Pro cluster
|
7 months ago |
Alex Cheema
|
2ff4638122
Merge remote-tracking branch 'origin/main' into runners
|
7 months ago |
Alex Cheema
|
342b5d8ac0
Merge pull request #555 from exo-explore/modelvariations
|
7 months ago |
Alex Cheema
|
a0bada3b2a
add llama-3.2-1b-8bit, llama-3.2-3b-8bit, llama-3.2-3b-bf16
|
7 months ago |
Alex Cheema
|
b6f2385c41
run llama-3.1-8b on 3 m4 pro cluster
|
7 months ago |
Alex Cheema
|
9472ab0d2c
t
|
7 months ago |
Alex Cheema
|
dbb7ad3c08
run with three m4 pro
|
7 months ago |
Alex Cheema
|
2abe57be21
grasping at straws
|
7 months ago |
Alex Cheema
|
eeecdcb409
try a different taskpolicy
|
7 months ago |
Alex Cheema
|
f9f76129a1
better bench system info
|
7 months ago |
Alex Cheema
|
8c6d37d9b8
m4 cluster test
|
7 months ago |
Alex Cheema
|
2f74ea112e
Merge pull request #542 from wbic16/fix-issue-458
|
7 months ago |
Alex Cheema
|
1194db6e65
m3
|
7 months ago |
Alex Cheema
|
8cb7327da2
re-enable m4 cluster run
|
7 months ago |
Alex Cheema
|
bba0aa0877
single node test 20
|
7 months ago |