Alex Cheema ca6095c04d a generic test for every inference engine пре 1 година
..
mlx ca6095c04d a generic test for every inference engine пре 1 година
__init__.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference пре 1 година
inference_engine.py 445eda156c dynamically assign shards to nodes deterministically weighted by memory пре 1 година
shard.py 563dcb56b0 mlx sharded implementation with example of distributed inference пре 1 година
test_inference_engine.py ca6095c04d a generic test for every inference engine пре 1 година