Alex Cheema ca6095c04d a generic test for every inference engine há 1 ano atrás
..
models ca6095c04d a generic test for every inference engine há 1 ano atrás
__init__.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference há 1 ano atrás
sharded_inference_engine.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference há 1 ano atrás
sharded_model.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference há 1 ano atrás
sharded_utils.py 563dcb56b0 mlx sharded implementation with example of distributed inference há 1 ano atrás
test_sharded_llama.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference há 1 ano atrás
test_sharded_model.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference há 1 ano atrás