提交历史

作者 SHA1 备注 提交日期
  Alex Cheema 6c1bf127b3 add --max-parallel-downloads flag that limits the number of downloads at a time with asyncio.semaphore 8 月之前
  Alex Cheema e6902b2fcf add --download-quick-check flag to bypass the hf api calls / remote file checks 8 月之前
  Alex Cheema 71591d2ebc display all interfaces web chat and chatgpt api are available on fixes #134 8 月之前
  Alex Cheema 6bddb2a9dc download edge cases 9 月之前
  Alex Cheema f29963f41e preemptively start downloads when any node starts processing a prompt. this fixes #104 9 月之前
  Alex Cheema 476a714bbb make a separate ShardDownloader abstract class w HFShardDownloader. this opens up plugging in different methods of downloading model shards e.g. #79 / #16 9 月之前
  Alex Cheema d22ed12e7b bring tinygrad to parity with mlx on llama models, show progress of each download file 9 月之前
  Alex Cheema 545a486ed3 separate hf_helpers, make extra dir with download_hf script, unify downloading so tinygrad uses the same method as mlx and interoperable model formats 9 月之前
  Alex Cheema 0bfb8e3b6d sticky node ids #16 9 月之前
  Alex Cheema d6a7e46324 async model downloading with download progress. fixes #102. related: #16 #104 9 月之前
  Alex Cheema 57b2f2a4e2 fix ruff lint errors 9 月之前
  Alex Cheema 9a373c2bb0 make configurable discovery timeout 9 月之前
  Alex Cheema 63a05d5b4f make configurable discovery timeout 9 月之前
  Alex Cheema 174cff071e Merge pull request #58 from jakobdylanc/main 9 月之前
  Alex Cheema b0e7dd9d2d add max-generate-tokens flag fixes #54 9 月之前
  JakobDylanC f2f61ccee6 inference engine selection improvements 9 月之前
  Alex Cheema 4e46232364 add simple prometheus metrics collection, with a prometheus / grafana instance for live dashboard. related: #22 9 月之前
  Alex Cheema 2e419ba211 Merge pull request #48 from itsknk/intel-mac 9 月之前
  itsknk e934664168 implement dynamic inference engine selection 9 月之前
  Alec Potluri db583a863f disable tui flag 9 月之前
  Alex Cheema e49924e1b9 add chatgpt-api-response-timeout-secs flag, set this to 20 mins in test 9 月之前
  Alex Cheema a342e1abd8 add web url and chatgpt api endpoint to panel (fixes #43), fix a rounding error in the partition to shard mapping implementation 9 月之前
  Alex Cheema d9484906a3 remove the spammy logs 9 月之前
  Alex Cheema 4b592f9d45 exo topology visualisation that shows the topology of the network, device capabilities and the currently active node using opaque statuses. fixes #36. ready for #33 9 月之前
  Alex Cheema 35177690bd by default find an ephemeral node port fixes #35, more robust topology updates. both fix #15 and #14 9 月之前
  Alex Cheema 945f90f676 allow overriding inference_engine and separate flag for TINYGRAD_DEBUG 9 月之前
  Alex Cheema 72fe293729 exo text on start and stop 9 月之前
  Alex Cheema 1e1e11cdc6 check if inference_engine has tokenizer before printing with it 9 月之前
  Alex Cheema eb92da2c3e cleaner chatgpt api impl with async callbacks 9 月之前
  Alex Cheema 71e00745cc fix tokenizer inconsistencies 9 月之前