Alex Cheema
|
963f8eb6a1
better logs for DEBUG>=1
|
1 年之前 |
Alex Cheema
|
a009f7d608
move examples to examples dir
|
1 年之前 |
Alex Cheema
|
b6595bac04
add llama-3-70b to the examples
|
1 年之前 |
Alex Cheema
|
54e8cad2d6
remove uneeded prints
|
1 年之前 |
Alex Cheema
|
c691205591
empty space
|
1 年之前 |
Alex Cheema
|
bcd58938de
clean debug logs
|
1 年之前 |
Alex Cheema
|
b9c323bb07
memory-efficient shard loading
|
1 年之前 |
Alex Cheema
|
53a5b3fc6a
add uuid requirement
|
1 年之前 |
Alex Cheema
|
05b9fa497d
initialize node id to uuid4 if not set
|
1 年之前 |
Alex Cheema
|
ff597d9551
fix discovery
|
1 年之前 |
Alex Cheema
|
a04974168e
fix model import path
|
1 年之前 |
Alex Cheema
|
b8a2a0fbe0
update readme run instruction
|
1 年之前 |
Alex Cheema
|
a933352ac3
add DEBUG flag for controlling debug logs
|
1 年之前 |
Alex Cheema
|
dd882fe6bc
experimental notice
|
1 年之前 |
Alex Cheema
|
c8753ba5fe
reshuffle readme
|
1 年之前 |
Alex Cheema
|
ee5204fbca
readme installation instructions
|
1 年之前 |
Alex Cheema
|
78da11e10b
slightly nicer readme
|
1 年之前 |
Alex Cheema
|
2fc472c8fe
slightly nicer readme
|
1 年之前 |
Alex Cheema
|
8ff3e263a0
slightly nicer readme
|
1 年之前 |
Alex Cheema
|
32f2e36fd3
main rename
|
1 年之前 |
Alex Cheema
|
5bbde22a23
move everything under exo module
|
1 年之前 |
Alex Cheema
|
c851644a43
update requirements, specify exact versions
|
1 年之前 |
Alex Cheema
|
32972033dd
update readme
|
1 年之前 |
Alex Cheema
|
5ef07d41a5
readme
|
1 年之前 |
Alex Cheema
|
490fa102a4
tinygrad inference engine
|
1 年之前 |
Alex Cheema
|
e6f387a690
handle is_finished
|
1 年之前 |
Alex Cheema
|
b01f69bb6b
add support for multiple concurrent requests with request ids
|
1 年之前 |
Alex Cheema
|
7077652c8e
graceful node shutdown
|
1 年之前 |
Alex Cheema
|
ca6095c04d
a generic test for every inference engine
|
1 年之前 |
Alex Cheema
|
850b72d3ea
make StatefulShardedModel callable, add some tests for mlx sharded inference
|
1 年之前 |