Commit Verlauf

Autor SHA1 Nachricht Datum
  Nel Nibcord 8f0d19e9b0 embed fix vor 8 Monaten
  Nel Nibcord ee97563b45 Dummied up an abstact save_checkpoint vor 8 Monaten
  Nel Nibcord 02281ebe3d Missed one vor 8 Monaten
  Nel Nibcord dffa17b2d1 Slightly simplified waiting for outstanding requests vor 8 Monaten
  Nel Nibcord 720940d563 Removed statefulModel stuff from mlx impl too vor 8 Monaten
  Nel Nibcord bc2812238f Removed ensure_session to clean stuff up. May revisit later vor 8 Monaten
  Nel Nibcord e9971f74ae Abstract load checkpoint method vor 8 Monaten
  Nel Nibcord 223d35cea0 circular include lol vor 8 Monaten
  Nel Nibcord 8dc73074e6 Nodes don't need an abstract base class vor 8 Monaten
  Nel Nibcord 59af2dd592 Do we need casting here? vor 8 Monaten
  Nel Nibcord b22c21ac16 Some session method cleanup vor 8 Monaten
  Nel Nibcord 98edb393b2 Initialize inference engine session in base class vor 8 Monaten
  Nel Nibcord bcf87e79b7 Okay let's turn no_grad back on. We'll worry about that when tinygrad training works vor 8 Monaten
  Nel Nibcord b7bbda3348 Removed tinygrad StatefulModel class, as it's no longer used vor 8 Monaten
  Nel Nibcord 67f5ae25a5 Fixing tinygrad model vor 8 Monaten
  Nel Nibcord bfa3b36be5 Fixing tinygrad model vor 8 Monaten
  Nel Nibcord 37a75d6b96 Fixing tinygrad model vor 8 Monaten
  Nel Nibcord 0d3abfca95 Made models save properly vor 8 Monaten
  Nel Nibcord 9283f6d7bd Correct loss propagation so we can see the actual loss instead of just the requestor shard's loss vor 8 Monaten
  Nel Nibcord 9eadee310b Basic model saving vor 8 Monaten
  Nel Nibcord 38e368f00b Fixed up the ops so that batches work vor 8 Monaten
  Nel Nibcord dd3d99043b Working distributed training vor 8 Monaten
  Nel Nibcord 175ebc1c42 Coordination biz vor 8 Monaten
  Nel Nibcord 3e869051f6 Okay we should probably await the update vor 8 Monaten
  Nel Nibcord 75c8650f1f Naive network-propagated loss implementation on MLX vor 8 Monaten
  Nel Nibcord 836856824e WIP: Training works on mlx vor 8 Monaten
  Nel Nibcord a6fd7a3430 Generalizing some of the dataset biz while also creating uniform batches vor 8 Monaten
  Nel Nibcord f5efbe1b8f Initial distributed evaluation implementation vor 8 Monaten
  Alex Cheema db9de97fa6 Merge pull request #549 from exo-explore/fixtokenencode vor 8 Monaten
  Alex Cheema c593434808 fix encode endpoint vor 8 Monaten