1
0

Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Alex Cheema ee5204fbca readme installation instructions 1 жил өмнө
  Alex Cheema 78da11e10b slightly nicer readme 1 жил өмнө
  Alex Cheema 2fc472c8fe slightly nicer readme 1 жил өмнө
  Alex Cheema 8ff3e263a0 slightly nicer readme 1 жил өмнө
  Alex Cheema 32f2e36fd3 main rename 1 жил өмнө
  Alex Cheema 5bbde22a23 move everything under exo module 1 жил өмнө
  Alex Cheema c851644a43 update requirements, specify exact versions 1 жил өмнө
  Alex Cheema 32972033dd update readme 1 жил өмнө
  Alex Cheema 5ef07d41a5 readme 1 жил өмнө
  Alex Cheema 490fa102a4 tinygrad inference engine 1 жил өмнө
  Alex Cheema e6f387a690 handle is_finished 1 жил өмнө
  Alex Cheema b01f69bb6b add support for multiple concurrent requests with request ids 1 жил өмнө
  Alex Cheema 7077652c8e graceful node shutdown 1 жил өмнө
  Alex Cheema ca6095c04d a generic test for every inference engine 1 жил өмнө
  Alex Cheema 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference 1 жил өмнө
  Alex Cheema 6ee0547eff fix layer calculation for sharded llama 1 жил өмнө
  Alex Cheema 445eda156c dynamically assign shards to nodes deterministically weighted by memory 1 жил өмнө
  Alex Cheema 36b8456798 collect global topology with local peer visibility, ring memory weighted partitioning strategy 1 жил өмнө
  Alex Cheema 3a66a0a4a8 add requirements.txt 1 жил өмнө
  Alex Cheema ee96c6b023 add another test for device capabiities on MacBook Air 1 жил өмнө
  Alex Cheema 6c8c9ee7b1 topology with partitioning strategy 1 жил өмнө
  Alex Cheema 563dcb56b0 mlx sharded implementation with example of distributed inference 1 жил өмнө
  Alex Cheema a21f59ff45 scaffolding for networking, inference and orchestration 1 жил өмнө