lqb/exo

Autor	SHA1 Mensaje	Fecha
Alex Cheema	5c67e24c35 smart prompt longest prefix matching to avoid sending the same text through the NN again. speeds up prefill significantly	hace 1 año
Alex Cheema	94ac9463a7 fix model id for llama 3.1 405b now its finally on the hub	hace 1 año
Alex Cheema	178fb75c84 fix image api prompt encoding	hace 1 año
Alex Cheema	2d20000964 use AutoProcessor with use_fast=False since there's a bug with use_fast=True where whitespace is removed on single token decodes	hace 1 año
Alex Cheema	0ec77e1a99 Merge pull request #88 from varshith15/main	hace 1 año
Alex Cheema	af1c7ce327 add support for image upload to tinychat for vision models	hace 1 año
Alex Cheema	0d45a855fb increase max request size to send raw images, make image download from url async, use chatgpt-compatible convention for images	hace 1 año
Alex Cheema	e68d06f4ef move model-selector styles to index.css	hace 1 año
Alex Cheema	78db451d7e add pillow to main dependencies	hace 1 año
Alex Cheema	824f05263f Merge branch 'main' into HEAD	hace 1 año
Alex Cheema	142682645f bump up tinygrad version	hace 1 año
Varshith	8d3d3df1dd update readme	hace 1 año
Varshith	acc94b50c7 chatgpt api integration	hace 1 año
Alex Cheema	33cbacf513 fix llava sanitize	hace 1 año
Alex Cheema	2fb961fccd stick to same convention as new llama	hace 1 año
Alex Cheema	b44b917151 add pillow as testing dependency	hace 1 año
Alex Cheema	2aa1e24ea9 remove unused torch import	hace 1 año
Alex Cheema	833e7f3396 rename sharded_llava -> llava to match new convention	hace 1 año
Alex Cheema	7d5eed1111 Merge branch 'main' into HEAD	hace 1 año
Alex Cheema	044d189ccc Merge pull request #94 from mzbac/mlx_refactor	hace 1 año
Alex Cheema	909d5ef8ba Merge branch 'main' into mlx_refactor	hace 1 año
Alex Cheema	63e51a8270 formatting	hace 1 año
Alex Cheema	6695b019a2 format format.py	hace 1 año
Alex Cheema	1dc08fecaa increase max line length to 200	hace 1 año
Alex Cheema	444137776a formatting	hace 1 año
Anchen	a6bb8ddf41 update deepseek sanitize to shard layers first before handle switch	hace 1 año
Alex Cheema	cb217b7b77 format format.py	hace 1 año
Alex Cheema	4cb36a7f55 increase max line length to 200	hace 1 año
Alex Cheema	d94e3f9ce4 formatting	hace 1 año
Anchen	666b1c83ee refactor(mlx): model sharding and add deepseek v2 support	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits