inference
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 рік тому |
networking
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 рік тому |
orchestration
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 рік тому |
topology
|
36b8456798
collect global topology with local peer visibility, ring memory weighted partitioning strategy
|
1 рік тому |
.gitignore
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
1 рік тому |
example_user.py
|
36b8456798
collect global topology with local peer visibility, ring memory weighted partitioning strategy
|
1 рік тому |
example_user_2.py
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 рік тому |
main.py
|
36b8456798
collect global topology with local peer visibility, ring memory weighted partitioning strategy
|
1 рік тому |
main_dynamic.py
|
7077652c8e
graceful node shutdown
|
1 рік тому |
requirements.txt
|
3a66a0a4a8
add requirements.txt
|
1 рік тому |