Historique des commits

Auteur SHA1 Message Date
  Alex Cheema ca6095c04d a generic test for every inference engine il y a 1 an
  Alex Cheema 850b72d3ea make StatefulShardedModel callable, add some tests for mlx sharded inference il y a 1 an
  Alex Cheema 6ee0547eff fix layer calculation for sharded llama il y a 1 an
  Alex Cheema 445eda156c dynamically assign shards to nodes deterministically weighted by memory il y a 1 an
  Alex Cheema 36b8456798 collect global topology with local peer visibility, ring memory weighted partitioning strategy il y a 1 an
  Alex Cheema 3a66a0a4a8 add requirements.txt il y a 1 an
  Alex Cheema ee96c6b023 add another test for device capabiities on MacBook Air il y a 1 an
  Alex Cheema 6c8c9ee7b1 topology with partitioning strategy il y a 1 an
  Alex Cheema 563dcb56b0 mlx sharded implementation with example of distributed inference il y a 1 an
  Alex Cheema a21f59ff45 scaffolding for networking, inference and orchestration il y a 1 an