lqb/exo

Author	SHA1 Message	Date
Alex Cheema	84afdbcbe8 add --download-quick-check flag to bypass the hf api calls / remote file checks	10 months ago
Alex Cheema	1f736dacd7 Merge pull request #135 from exo-explore/exo_interfaces	10 months ago
Alex Cheema	71591d2ebc display all interfaces web chat and chatgpt api are available on fixes #134	10 months ago
Alex Cheema	3bd5a116df ignore files that dont match allow patterns	10 months ago
Alex Cheema	3db3e8294c make download panel slightly larger	10 months ago
Alex Cheema	9e78c42b4b Merge pull request #124 from exo-explore/refactor_model_download	10 months ago
Alex Cheema	5112f53a37 trigger ci	10 months ago
Alex Cheema	047ef48c1d use separate hf cache dirs for chatgpt api integration test. its an unusual setup where we're running 2 exo instances on the same device which share a disk and hf cache	10 months ago
Alex Cheema	2be446546f refactor tinygrad, only load necessary layers for each shard fixes #128, enable JIT (much faster), prefill all layers not just the first shard fixes #12, use new ShardDownloader for more robust, parallel downloads	10 months ago
Alex Cheema	357331c55f remove some logs, make get_allow_patterns out of class	10 months ago
Alex Cheema	b1eb05ed47 debug level 7 for tests	10 months ago
Alex Cheema	09a9abc065 fix inference engine test	10 months ago
Alex Cheema	67c269076b Merge branch 'main' into refactor_model_download	10 months ago
Alex Cheema	dd41026c5b cache completed download paths	10 months ago
Alex Cheema	706488732f disable prefix matching on prompts. causes subsequent requests to fail with cannot be broadcast. hotfix for #130	10 months ago
Alex Cheema	35b7042e70 upgrade mlx to 0.16.1	10 months ago
Alex Cheema	b181f8aa82 handle writing responses errors	10 months ago
Alex Cheema	7ec660bba6 fix shard download	10 months ago
Alex Cheema	29f154597c init active_downloadsa	10 months ago
Alex Cheema	6bddb2a9dc download edge cases	10 months ago
Alex Cheema	f29963f41e preemptively start downloads when any node starts processing a prompt. this fixes #104	10 months ago
Alex Cheema	7a65a96e52 download progress styling	10 months ago
Alex Cheema	c59ceab821 viz spacing	10 months ago
Alex Cheema	0a588d0443 viz styles	10 months ago
Alex Cheema	d9f232b313 cleaner download progress ui	10 months ago
Alex Cheema	476a714bbb make a separate ShardDownloader abstract class w HFShardDownloader. this opens up plugging in different methods of downloading model shards e.g. #79 / #16	10 months ago
Alex Cheema	d22ed12e7b bring tinygrad to parity with mlx on llama models, show progress of each download file	10 months ago
Alex Cheema	45142dab26 tests	10 months ago
Alex Cheema	545a486ed3 separate hf_helpers, make extra dir with download_hf script, unify downloading so tinygrad uses the same method as mlx and interoperable model formats	10 months ago
Alex Cheema	9014efae86 minimal script to download from hf async with progress	10 months ago

Newer Older

Commit History Find

Commit History