| .. |
|
mlx
|
ca6095c04d
a generic test for every inference engine
|
пре 1 година |
|
__init__.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
пре 1 година |
|
inference_engine.py
|
445eda156c
dynamically assign shards to nodes deterministically weighted by memory
|
пре 1 година |
|
shard.py
|
563dcb56b0
mlx sharded implementation with example of distributed inference
|
пре 1 година |
|
test_inference_engine.py
|
ca6095c04d
a generic test for every inference engine
|
пре 1 година |