Alex Cheema
|
b8a2a0fbe0
update readme run instruction
|
1 year ago |
Alex Cheema
|
a933352ac3
add DEBUG flag for controlling debug logs
|
1 year ago |
Alex Cheema
|
dd882fe6bc
experimental notice
|
1 year ago |
Alex Cheema
|
c8753ba5fe
reshuffle readme
|
1 year ago |
Alex Cheema
|
ee5204fbca
readme installation instructions
|
1 year ago |
Alex Cheema
|
78da11e10b
slightly nicer readme
|
1 year ago |
Alex Cheema
|
2fc472c8fe
slightly nicer readme
|
1 year ago |
Alex Cheema
|
8ff3e263a0
slightly nicer readme
|
1 year ago |
Alex Cheema
|
32f2e36fd3
main rename
|
1 year ago |
Alex Cheema
|
5bbde22a23
move everything under exo module
|
1 year ago |
Alex Cheema
|
c851644a43
update requirements, specify exact versions
|
1 year ago |
Alex Cheema
|
32972033dd
update readme
|
1 year ago |
Alex Cheema
|
5ef07d41a5
readme
|
1 year ago |
Alex Cheema
|
490fa102a4
tinygrad inference engine
|
1 year ago |
Alex Cheema
|
e6f387a690
handle is_finished
|
1 year ago |
Alex Cheema
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 year ago |
Alex Cheema
|
7077652c8e
graceful node shutdown
|
1 year ago |
Alex Cheema
|
ca6095c04d
a generic test for every inference engine
|
1 year ago |
Alex Cheema
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
1 year ago |
Alex Cheema
|
6ee0547eff
fix layer calculation for sharded llama
|
1 year ago |
Alex Cheema
|
445eda156c
dynamically assign shards to nodes deterministically weighted by memory
|
1 year ago |
Alex Cheema
|
36b8456798
collect global topology with local peer visibility, ring memory weighted partitioning strategy
|
1 year ago |
Alex Cheema
|
3a66a0a4a8
add requirements.txt
|
1 year ago |
Alex Cheema
|
ee96c6b023
add another test for device capabiities on MacBook Air
|
1 year ago |
Alex Cheema
|
6c8c9ee7b1
topology with partitioning strategy
|
1 year ago |
Alex Cheema
|
563dcb56b0
mlx sharded implementation with example of distributed inference
|
1 year ago |
Alex Cheema
|
a21f59ff45
scaffolding for networking, inference and orchestration
|
1 year ago |