Commit History

Author SHA1 Message Date
  Alex Cheema a342e1abd8 add web url and chatgpt api endpoint to panel (fixes #43), fix a rounding error in the partition to shard mapping implementation 1 year ago
  Alex Cheema d9484906a3 remove the spammy logs 1 year ago
  Alex Cheema 4b592f9d45 exo topology visualisation that shows the topology of the network, device capabilities and the currently active node using opaque statuses. fixes #36. ready for #33 1 year ago
  Alex Cheema 35177690bd by default find an ephemeral node port fixes #35, more robust topology updates. both fix #15 and #14 1 year ago
  Alex Cheema 945f90f676 allow overriding inference_engine and separate flag for TINYGRAD_DEBUG 1 year ago
  Alex Cheema 72fe293729 exo text on start and stop 1 year ago
  Alex Cheema 1e1e11cdc6 check if inference_engine has tokenizer before printing with it 1 year ago
  Alex Cheema eb92da2c3e cleaner chatgpt api impl with async callbacks 1 year ago
  Alex Cheema 71e00745cc fix tokenizer inconsistencies 1 year ago
  Alex Cheema ce46f00059 linux device capabilities 1 year ago
  Alex Cheema dbbc7be57f remove hard dependency on MLX fixes #8 1 year ago
  Alex Cheema dd8d18128c add an opaque inference_state that inference engines can use to pass around small state to other devices 1 year ago
  Alex Cheema f2895cbcee revive the chatgpt api endpoint on :8000 1 year ago
  Alex Cheema 05b9fa497d initialize node id to uuid4 if not set 1 year ago
  Alex Cheema 32f2e36fd3 main rename 1 year ago
  Alex Cheema 5bbde22a23 move everything under exo module 1 year ago
  Alex Cheema 36b8456798 collect global topology with local peer visibility, ring memory weighted partitioning strategy 1 year ago
  Alex Cheema 563dcb56b0 mlx sharded implementation with example of distributed inference 1 year ago
  Alex Cheema a21f59ff45 scaffolding for networking, inference and orchestration 1 year ago