| .. |
|
models
|
ca6095c04d
a generic test for every inference engine
|
há 1 ano atrás |
|
__init__.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
há 1 ano atrás |
|
sharded_inference_engine.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
há 1 ano atrás |
|
sharded_model.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
há 1 ano atrás |
|
sharded_utils.py
|
563dcb56b0
mlx sharded implementation with example of distributed inference
|
há 1 ano atrás |
|
test_sharded_llama.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
há 1 ano atrás |
|
test_sharded_model.py
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
há 1 ano atrás |