James Shield
|
a0024fd421
feat: support HF_ENDPOINT base url ENV VAR
|
10 months ago |
Alex Cheema
|
db9f44d16d
website link
|
10 months ago |
Alex Cheema
|
6c875dcc81
update hiring link
|
10 months ago |
Alex Cheema
|
074228e326
update README with hiring
|
10 months ago |
Alex Cheema
|
198cd6fb17
trigger ci
|
10 months ago |
Alex Cheema
|
20522e0638
update docs to make tinygrad usage clearer
|
10 months ago |
Alex Cheema
|
e0ed9170db
Merge pull request #209 from GaetanLepage/used-ports
|
10 months ago |
Gaetan Lepage
|
4b009401f9
move `.exo_used_ports` to `/tmp`
|
11 months ago |
Alex Cheema
|
0fa15367f7
Merge pull request #208 from exo-explore/broken_links_readme
|
11 months ago |
Alex Cheema
|
ca64456260
fix broken links in README
|
11 months ago |
Alex Cheema
|
87e08f89f1
Merge pull request #203 from exo-explore/non_blocking
|
11 months ago |
Alex Cheema
|
874886abc4
simplify mlx non blocking
|
11 months ago |
Alex Cheema
|
e616d4e86b
run realize on the result in tinygrad
|
11 months ago |
Alex Cheema
|
9345684b38
closely match prev impl mlx non blocking
|
11 months ago |
Alex Cheema
|
d6e661fd69
match previous impl with np.array in mlx
|
11 months ago |
Alex Cheema
|
caf9b57a2a
trigger ci
|
11 months ago |
Alex Cheema
|
84187113de
add a test for hf get_weight_map
|
11 months ago |
Alex Cheema
|
4ec613d4e8
simplify tinygrad non blocking
|
11 months ago |
Alex Cheema
|
6342384df4
Merge branch 'main' into non_blocking
|
11 months ago |
Alex Cheema
|
a1a0ffac55
add tinychat option for llama-3.1-70b-bf16
|
11 months ago |
Alex Cheema
|
de19f0ab42
Merge branch 'main' into non_blocking
|
11 months ago |
Alex Cheema
|
2948a83448
add llama-3.1-70b-bf16 model option
|
11 months ago |
Alex Cheema
|
11dd952d26
use set for shard specific patterns
|
11 months ago |
Alex Cheema
|
ea3322dea4
remove comment
|
11 months ago |
Alex Cheema
|
e0fda94d20
use sets for shard specific patterns
|
11 months ago |
Alex Cheema
|
b239c8a6d0
Merge branch 'main' into non_blocking
|
11 months ago |
Alex Cheema
|
8f65e1e697
fix weight_map resolution. previously we were always defaulting to allow pattern *.safetensors
|
11 months ago |
Alex Cheema
|
6881722b72
simplify non-blocking mlx inference
|
11 months ago |
Alex Cheema
|
9db16f8dca
use a queue for non-blocking mlx inference
|
11 months ago |
Alex Cheema
|
0ca5c26094
run mlx inference engine on a single thread too
|
11 months ago |