Commit Verlauf

Autor SHA1 Nachricht Datum
  Nel Nibcord 90518a3bbe Hoisted caching to a wrapper class vor 10 Monaten
  Nel Nibcord bf33ffde87 This doesn't need to be a tuple really vor 10 Monaten
  Nel Nibcord 10e9f44a10 one-line output buffering vor 10 Monaten
  Nel Nibcord 52ef6ee4a3 Made temperature and top_p available to the inference engine sample interfaces vor 10 Monaten
  Nel Nibcord 8205a5aebc Implemented per-request caching in tinygrad vor 10 Monaten
  Nel Nibcord 13572e6a40 Some stability improvements for tinygrad inference vor 10 Monaten
  Nel Nibcord aefc0d7c51 I think this is more faithful to how it was originally done vor 10 Monaten
  Nel Nibcord c06b5f3b56 Corrected type annotations vor 10 Monaten
  Nel Nibcord 9b66758b59 Make sure they're np arrays vor 10 Monaten
  Nel Nibcord b9d0fb6825 Since infer_prompt is a thin wrapper that works the same for all inference engines, we can de-abstract it vor 10 Monaten
  Nel Nibcord 527c7a6e49 Applied new interface to tinygrad and dummy inference engines vor 10 Monaten
  Nel Nibcord 52b91de817 Changed model classname due to the sharding being done elsewhere vor 10 Monaten
  Nel Nibcord 34019e4608 Forgot an abstractmethod vor 10 Monaten
  Nel Nibcord 82cce4408e Some initial inference engine refactors for enabling training vor 10 Monaten
  Alex Cheema 4713bc5acd Merge pull request #431 from exo-explore/qwen32b vor 10 Monaten
  Alex Cheema e9ba815c21 add qwen2.5 coder 3b,14b,32b vor 10 Monaten
  Alex Cheema a0b6adad85 Merge pull request #430 from austinbv/patch-1 vor 10 Monaten
  Austin 5435671cd9 Add 32b Qwen 2.5 vor 10 Monaten
  Alex Cheema 526f8a7ad5 Merge pull request #429 from exo-explore/readme_hf_home vor 10 Monaten
  Alex Cheema 167e756b31 add documentation of HF_HOME model storage location in README. fixes #427 vor 10 Monaten
  Alex Cheema b41b7d778a Merge pull request #426 from exo-explore/tinygrad_ci_test vor 10 Monaten
  Alex Cheema 9e4366f36b tinygrad ci vor 10 Monaten
  Alex Cheema 6cd78b94d4 run tinygrad test with CLANG=1 vor 10 Monaten
  Alex Cheema 49c4394dfa enable tinygrad test vor 10 Monaten
  Alex Cheema 77d78935b7 remove redundant expected_content vor 10 Monaten
  Alex Cheema 8cc3f51e79 test for tinygrad e2e vor 10 Monaten
  Alex Cheema 858421a3a7 Merge pull request #418 from BatSmacker84/llama-3.2-support vor 10 Monaten
  Alex Cheema 832a860b34 Merge pull request #424 from exo-explore/llama405b-8bit vor 10 Monaten
  Alex Cheema 472359147d ignore 8bit llama 405b from tokenizers test vor 10 Monaten
  Alex Cheema 49833e1fde Merge pull request #423 from exo-explore/llama405b-8bit vor 10 Monaten