Alex Cheema
|
1f9d16ec78
run tokenizers test in ci, run all models available
|
11 months ago |
Alex Cheema
|
6243846eeb
ci logs
|
1 year ago |
Alex Cheema
|
cfe980bdaa
simplify ci
|
1 year ago |
Alex Cheema
|
9513c4fd17
ci tail log files
|
1 year ago |
Alex Cheema
|
7a02acdcd5
fix ci output streaming
|
1 year ago |
Alex Cheema
|
ad695696a5
run on every commit on main, reuqire approval on other branches
|
1 year ago |
Alex Cheema
|
710e5a31e7
TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit
|
1 year ago |
Alex Cheema
|
e17e5f9a41
tests for tokenizers. unfortunately use_fast=False and use_fast=True give different behaviour
|
1 year ago |
Alex Cheema
|
0d218e244e
use fast AutoProcessor fixes #164 tokenizer issues with mistral-large.
|
1 year ago |
Alex Cheema
|
23ae5e92c5
hold circleci tests for approval on non-main branches
|
1 year ago |
Alex Cheema
|
d54944f4ca
stream outputs from chatgpt api integration test
|
1 year ago |
Alex Cheema
|
1133e27ad3
Merge pull request #166 from exo-explore/formatting
|
1 year ago |
Alex Cheema
|
f53056dede
more compact operator formatting
|
1 year ago |
Alex Cheema
|
14f2846a9c
yapf set blank_line_before_nested_class_or_def to false
|
1 year ago |
Alex Cheema
|
ea70c9fb76
reformat with yapf format.py
|
1 year ago |
Alex Cheema
|
2e27076665
simplify formatting with yapf
|
1 year ago |
Alex Cheema
|
417114fae4
fix mistral nemo
|
1 year ago |
Alex Cheema
|
5101f03369
keep 4 in RotatingKVCache
|
1 year ago |
Alex Cheema
|
6db73fab1c
laptop gpu device capabilites
|
1 year ago |
Alex Cheema
|
647ffb94eb
increase cli generation timeout
|
1 year ago |
Alex Cheema
|
dd24e7db1e
only ignore CancelledError inside stop
|
1 year ago |
Alex Cheema
|
2e1233357c
ignore CancelledError when stopping the server
|
1 year ago |
Alex Cheema
|
dfa3fdcf08
Merge pull request #161 from exo-explore/cli
|
1 year ago |
Alex Cheema
|
ae35ada19b
fix headless mode with --disable-tui
|
1 year ago |
Alex Cheema
|
b95916e0b5
show prompts and outputs in tui
|
1 year ago |
Alex Cheema
|
e84304317c
add a cli that can be triggered with --run-model <model> --prompt <prompt>
|
1 year ago |
Alex Cheema
|
cea9b48d24
update mlx-lm to 0.17.0, use lru caches for kv_cache with RotatingKVCache to optimise memory fixes #158
|
1 year ago |
Alex Cheema
|
9c645b14f1
Merge pull request #157 from exo-explore/astra
|
1 year ago |
Alex Cheema
|
430d4c0cf8
astra clarify readme, it's an example app
|
1 year ago |
Alex Cheema
|
e87e7260f8
astra: live camera with overlay debug info / ui
|
1 year ago |