Nel Nibcord
|
52ef6ee4a3
Made temperature and top_p available to the inference engine sample interfaces
|
8 сар өмнө |
Nel Nibcord
|
8205a5aebc
Implemented per-request caching in tinygrad
|
8 сар өмнө |
Nel Nibcord
|
13572e6a40
Some stability improvements for tinygrad inference
|
8 сар өмнө |
Nel Nibcord
|
aefc0d7c51
I think this is more faithful to how it was originally done
|
8 сар өмнө |
Nel Nibcord
|
c06b5f3b56
Corrected type annotations
|
8 сар өмнө |
Nel Nibcord
|
9b66758b59
Make sure they're np arrays
|
8 сар өмнө |
Nel Nibcord
|
b9d0fb6825
Since infer_prompt is a thin wrapper that works the same for all inference engines, we can de-abstract it
|
8 сар өмнө |
Nel Nibcord
|
527c7a6e49
Applied new interface to tinygrad and dummy inference engines
|
8 сар өмнө |
Nel Nibcord
|
52b91de817
Changed model classname due to the sharding being done elsewhere
|
8 сар өмнө |
Nel Nibcord
|
34019e4608
Forgot an abstractmethod
|
8 сар өмнө |
Nel Nibcord
|
82cce4408e
Some initial inference engine refactors for enabling training
|
8 сар өмнө |
Alex Cheema
|
4713bc5acd
Merge pull request #431 from exo-explore/qwen32b
|
8 сар өмнө |
Alex Cheema
|
e9ba815c21
add qwen2.5 coder 3b,14b,32b
|
8 сар өмнө |
Alex Cheema
|
a0b6adad85
Merge pull request #430 from austinbv/patch-1
|
8 сар өмнө |
Austin
|
5435671cd9
Add 32b Qwen 2.5
|
8 сар өмнө |
Alex Cheema
|
526f8a7ad5
Merge pull request #429 from exo-explore/readme_hf_home
|
8 сар өмнө |
Alex Cheema
|
167e756b31
add documentation of HF_HOME model storage location in README. fixes #427
|
8 сар өмнө |
Alex Cheema
|
b41b7d778a
Merge pull request #426 from exo-explore/tinygrad_ci_test
|
8 сар өмнө |
Alex Cheema
|
9e4366f36b
tinygrad ci
|
8 сар өмнө |
Alex Cheema
|
6cd78b94d4
run tinygrad test with CLANG=1
|
8 сар өмнө |
Alex Cheema
|
49c4394dfa
enable tinygrad test
|
8 сар өмнө |
Alex Cheema
|
77d78935b7
remove redundant expected_content
|
8 сар өмнө |
Alex Cheema
|
8cc3f51e79
test for tinygrad e2e
|
8 сар өмнө |
Alex Cheema
|
858421a3a7
Merge pull request #418 from BatSmacker84/llama-3.2-support
|
8 сар өмнө |
Alex Cheema
|
832a860b34
Merge pull request #424 from exo-explore/llama405b-8bit
|
8 сар өмнө |
Alex Cheema
|
472359147d
ignore 8bit llama 405b from tokenizers test
|
8 сар өмнө |
Alex Cheema
|
49833e1fde
Merge pull request #423 from exo-explore/llama405b-8bit
|
8 сар өмнө |
Alex Cheema
|
98948441e3
add llama 3.1 405b 8bit at mlx-community/Meta-Llama-3.1-405B-Instruct-8bit
|
8 сар өмнө |
Alex Cheema
|
5133885095
Merge pull request #422 from samiamjidkhan/history-button
|
8 сар өмнө |
Sami Khan
|
0d8a1ee41e
Added clear all history button
|
9 сар өмнө |