Alex Cheema 490fa102a4 tinygrad inference engine 1 سال پیش
..
mlx ca6095c04d a generic test for every inference engine 1 سال پیش
tinygrad 490fa102a4 tinygrad inference engine 1 سال پیش
__init__.py 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference 1 سال پیش
inference_engine.py b01f69bb6b add support for multiple concurrent requests with request ids 1 سال پیش
shard.py 563dcb56b0 mlx sharded implementation with example of distributed inference 1 سال پیش
test_inference_engine.py 490fa102a4 tinygrad inference engine 1 سال پیش