Commit History

Author SHA1 Message Date
  James Shield a0024fd421 feat: support HF_ENDPOINT base url ENV VAR 10 months ago
  Alex Cheema db9f44d16d website link 10 months ago
  Alex Cheema 6c875dcc81 update hiring link 10 months ago
  Alex Cheema 074228e326 update README with hiring 10 months ago
  Alex Cheema 198cd6fb17 trigger ci 10 months ago
  Alex Cheema 20522e0638 update docs to make tinygrad usage clearer 10 months ago
  Alex Cheema e0ed9170db Merge pull request #209 from GaetanLepage/used-ports 10 months ago
  Gaetan Lepage 4b009401f9 move `.exo_used_ports` to `/tmp` 11 months ago
  Alex Cheema 0fa15367f7 Merge pull request #208 from exo-explore/broken_links_readme 11 months ago
  Alex Cheema ca64456260 fix broken links in README 11 months ago
  Alex Cheema 87e08f89f1 Merge pull request #203 from exo-explore/non_blocking 11 months ago
  Alex Cheema 874886abc4 simplify mlx non blocking 11 months ago
  Alex Cheema e616d4e86b run realize on the result in tinygrad 11 months ago
  Alex Cheema 9345684b38 closely match prev impl mlx non blocking 11 months ago
  Alex Cheema d6e661fd69 match previous impl with np.array in mlx 11 months ago
  Alex Cheema caf9b57a2a trigger ci 11 months ago
  Alex Cheema 84187113de add a test for hf get_weight_map 11 months ago
  Alex Cheema 4ec613d4e8 simplify tinygrad non blocking 11 months ago
  Alex Cheema 6342384df4 Merge branch 'main' into non_blocking 11 months ago
  Alex Cheema a1a0ffac55 add tinychat option for llama-3.1-70b-bf16 11 months ago
  Alex Cheema de19f0ab42 Merge branch 'main' into non_blocking 11 months ago
  Alex Cheema 2948a83448 add llama-3.1-70b-bf16 model option 11 months ago
  Alex Cheema 11dd952d26 use set for shard specific patterns 11 months ago
  Alex Cheema ea3322dea4 remove comment 11 months ago
  Alex Cheema e0fda94d20 use sets for shard specific patterns 11 months ago
  Alex Cheema b239c8a6d0 Merge branch 'main' into non_blocking 11 months ago
  Alex Cheema 8f65e1e697 fix weight_map resolution. previously we were always defaulting to allow pattern *.safetensors 11 months ago
  Alex Cheema 6881722b72 simplify non-blocking mlx inference 11 months ago
  Alex Cheema 9db16f8dca use a queue for non-blocking mlx inference 11 months ago
  Alex Cheema 0ca5c26094 run mlx inference engine on a single thread too 11 months ago