lqb/exo

Autor	SHA1 Zpráva	Datum
Alex Cheema	142682645f bump up tinygrad version	před 1 rokem
Alex Cheema	044d189ccc Merge pull request #94 from mzbac/mlx_refactor	před 1 rokem
Alex Cheema	909d5ef8ba Merge branch 'main' into mlx_refactor	před 1 rokem
Alex Cheema	63e51a8270 formatting	před 1 rokem
Alex Cheema	6695b019a2 format format.py	před 1 rokem
Alex Cheema	1dc08fecaa increase max line length to 200	před 1 rokem
Alex Cheema	444137776a formatting	před 1 rokem
Anchen	a6bb8ddf41 update deepseek sanitize to shard layers first before handle switch	před 1 rokem
Alex Cheema	cb217b7b77 format format.py	před 1 rokem
Alex Cheema	4cb36a7f55 increase max line length to 200	před 1 rokem
Alex Cheema	d94e3f9ce4 formatting	před 1 rokem
Anchen	666b1c83ee refactor(mlx): model sharding and add deepseek v2 support	před 1 rokem
Alex Cheema	931ced7c01 fix a few more linter errors	před 1 rokem
Alex Cheema	57b2f2a4e2 fix ruff lint errors	před 1 rokem
Alex Cheema	ce761038ac formatting / linting	před 1 rokem
Alex Cheema	f1bd5fe152 Merge pull request #90 from xeb/main	před 1 rokem
Alex Cheema	f051ebe6e0 remove accidentally added files	před 1 rokem
Mark Kockerbeck	5eafd5a305 try/except for decode, #75	před 1 rokem
Alex Cheema	faa1319470 disable chatgpt api integration test, github changed something in their mac runners? perhaps time to switch over to circleci like mlx	před 1 rokem
Alex Cheema	67a1aaa823 check processes in github workflow	před 1 rokem
Alex Cheema	9a3ac273a9 Merge pull request #77 from Cloud1590/main	před 1 rokem
Alex Cheema	628d8679b0 force mlx inference engine in github workflow, where it defaults to tinygrad because it's running on 'model': 'Apple Virtual Machine 1', 'chip': 'Apple M1 (Virtual)'	před 1 rokem
Alex Cheema	e856d7f7f9 log chatgpt integration test output from each process on github workflow failure	před 1 rokem
Mark Kockerbeck	d2fa7b247e Showing the message only if successfully decoded, #75	před 1 rokem
Mark Kockerbeck	f1cd5ae7a6 Merge branch 'main' of github.com:xeb/exo	před 1 rokem
Mark Kockerbeck	4f5ab78d9d Addressing issue #75 to avoid decoding binary packets	před 1 rokem
Alex Cheema	5a23376059 add log_request middleware if DEBUG>=2 to chatgpt api to debug api issues, default always to llama-3.1-8b	před 1 rokem
Alex Cheema	2084784470 per-request kv cache, remove all explicit reset functionality as it wasnt used. fixes #67	před 1 rokem
Alex Cheema	dd8c5d63a9 add support for mistral nemo and mistral large	před 1 rokem
Alex Cheema	03fe7a058c more robust message parsing fixes #81	před 1 rokem

Novější Starší

Historie revizí Hledat

Historie revizí