Alex Cheema
|
1d48192094
ci test
|
10 months ago |
Alex Cheema
|
ad695696a5
run on every commit on main, reuqire approval on other branches
|
10 months ago |
Alex Cheema
|
710e5a31e7
TODO for why use_fast=False is giving inconsistent behaviour (no spaces decoding invididual tokens) for Mistral-Large-Instruct-2407-4bit
|
10 months ago |
Alex Cheema
|
e17e5f9a41
tests for tokenizers. unfortunately use_fast=False and use_fast=True give different behaviour
|
10 months ago |
Alex Cheema
|
0d218e244e
use fast AutoProcessor fixes #164 tokenizer issues with mistral-large.
|
10 months ago |
Alex Cheema
|
23ae5e92c5
hold circleci tests for approval on non-main branches
|
10 months ago |
Alex Cheema
|
d54944f4ca
stream outputs from chatgpt api integration test
|
10 months ago |
Alex Cheema
|
1133e27ad3
Merge pull request #166 from exo-explore/formatting
|
10 months ago |
Alex Cheema
|
f53056dede
more compact operator formatting
|
10 months ago |
Alex Cheema
|
14f2846a9c
yapf set blank_line_before_nested_class_or_def to false
|
10 months ago |
Alex Cheema
|
ea70c9fb76
reformat with yapf format.py
|
10 months ago |
Alex Cheema
|
2e27076665
simplify formatting with yapf
|
10 months ago |
Alex Cheema
|
417114fae4
fix mistral nemo
|
10 months ago |
Alex Cheema
|
5101f03369
keep 4 in RotatingKVCache
|
10 months ago |
Alex Cheema
|
6db73fab1c
laptop gpu device capabilites
|
10 months ago |
Alex Cheema
|
647ffb94eb
increase cli generation timeout
|
10 months ago |
Alex Cheema
|
dd24e7db1e
only ignore CancelledError inside stop
|
10 months ago |
Alex Cheema
|
2e1233357c
ignore CancelledError when stopping the server
|
10 months ago |
Alex Cheema
|
dfa3fdcf08
Merge pull request #161 from exo-explore/cli
|
10 months ago |
Alex Cheema
|
ae35ada19b
fix headless mode with --disable-tui
|
10 months ago |
Alex Cheema
|
b95916e0b5
show prompts and outputs in tui
|
10 months ago |
Alex Cheema
|
e84304317c
add a cli that can be triggered with --run-model <model> --prompt <prompt>
|
10 months ago |
Alex Cheema
|
cea9b48d24
update mlx-lm to 0.17.0, use lru caches for kv_cache with RotatingKVCache to optimise memory fixes #158
|
10 months ago |
Alex Cheema
|
9c645b14f1
Merge pull request #157 from exo-explore/astra
|
10 months ago |
Alex Cheema
|
430d4c0cf8
astra clarify readme, it's an example app
|
10 months ago |
Alex Cheema
|
e87e7260f8
astra: live camera with overlay debug info / ui
|
10 months ago |
Alex Cheema
|
1a419f1f00
astra better ui with camera vlm
|
10 months ago |
Alex Cheema
|
8503543894
fix streaming, change default model to llava
|
10 months ago |
Alex Cheema
|
c94ffa0e2b
fix audio buffering
|
10 months ago |
Alex Cheema
|
23c713c012
better readme for astra
|
10 months ago |