Nel Nibcord
|
c2332e2478
Moved nodes around
|
7 ay önce |
Nel Nibcord
|
763fbf8486
Updated node refs
|
7 ay önce |
Nel Nibcord
|
59af2dd592
Do we need casting here?
|
8 ay önce |
Nel Nibcord
|
b22c21ac16
Some session method cleanup
|
8 ay önce |
Nel Nibcord
|
98edb393b2
Initialize inference engine session in base class
|
8 ay önce |
Nel Nibcord
|
bcf87e79b7
Okay let's turn no_grad back on. We'll worry about that when tinygrad training works
|
8 ay önce |
Nel Nibcord
|
b7bbda3348
Removed tinygrad StatefulModel class, as it's no longer used
|
8 ay önce |
Nel Nibcord
|
67f5ae25a5
Fixing tinygrad model
|
8 ay önce |
Nel Nibcord
|
bfa3b36be5
Fixing tinygrad model
|
8 ay önce |
Nel Nibcord
|
37a75d6b96
Fixing tinygrad model
|
8 ay önce |
Nel Nibcord
|
0d3abfca95
Made models save properly
|
8 ay önce |
Nel Nibcord
|
9283f6d7bd
Correct loss propagation so we can see the actual loss instead of just the requestor shard's loss
|
8 ay önce |
Nel Nibcord
|
9eadee310b
Basic model saving
|
8 ay önce |
Nel Nibcord
|
38e368f00b
Fixed up the ops so that batches work
|
8 ay önce |
Nel Nibcord
|
dd3d99043b
Working distributed training
|
8 ay önce |
Nel Nibcord
|
175ebc1c42
Coordination biz
|
8 ay önce |
Nel Nibcord
|
3e869051f6
Okay we should probably await the update
|
8 ay önce |
Nel Nibcord
|
75c8650f1f
Naive network-propagated loss implementation on MLX
|
8 ay önce |
Nel Nibcord
|
836856824e
WIP: Training works on mlx
|
8 ay önce |
Nel Nibcord
|
a6fd7a3430
Generalizing some of the dataset biz while also creating uniform batches
|
8 ay önce |
Nel Nibcord
|
f5efbe1b8f
Initial distributed evaluation implementation
|
8 ay önce |
Alex Cheema
|
db9de97fa6
Merge pull request #549 from exo-explore/fixtokenencode
|
8 ay önce |
Alex Cheema
|
c593434808
fix encode endpoint
|
8 ay önce |
Alex Cheema
|
9f86737a94
fix token encode to use the right model
|
8 ay önce |
Alex Cheema
|
d411559a8b
Merge pull request #548 from exo-explore/subprocessforkfix
|
8 ay önce |
Alex Cheema
|
24130da4fd
prio mac check for interface
|
8 ay önce |
Alex Cheema
|
d6d74e9c0e
Merge pull request #547 from exo-explore/subprocessforkfix
|
8 ay önce |
Alex Cheema
|
31d7bc2df6
subprocess fork fix
|
8 ay önce |
Alex Cheema
|
022fabf7ff
Merge pull request #545 from exo-explore/topofixes2
|
8 ay önce |
Alex Cheema
|
56842a27f6
ignore topology merges from the non-owner
|
8 ay önce |