Historial de Commits

Autor SHA1 Mensaje Fecha
  Rory Clear 3384fc7294 update tinygrad version hace 5 meses
  Nel Nibcord 8b71d57da7 Removed inference state entirely hace 5 meses
  Nel Nibcord 65fdc99ccc Call no longer needs request_id hace 5 meses
  Nel Nibcord 90518a3bbe Hoisted caching to a wrapper class hace 5 meses
  Nel Nibcord 8205a5aebc Implemented per-request caching in tinygrad hace 5 meses
  Nel Nibcord 13572e6a40 Some stability improvements for tinygrad inference hace 5 meses
  Nel Nibcord 527c7a6e49 Applied new interface to tinygrad and dummy inference engines hace 5 meses
  Ogden Wells fbec1d2b10 formatted changes hace 5 meses
  Ogden Wells af01b23a07 added rope_scaling and tie_word_embeddings to llama transformer hace 5 meses
  Alex Cheema f53056dede more compact operator formatting hace 8 meses
  Alex Cheema 14f2846a9c yapf set blank_line_before_nested_class_or_def to false hace 8 meses
  Alex Cheema ea70c9fb76 reformat with yapf format.py hace 8 meses
  Alex Cheema 803dffd1c4 always call convert_from_huggingface with tinygrad models. this was broken by shard layer filtering which made the check sometimes fail. fixes #144 hace 8 meses
  Alex Cheema 2be446546f refactor tinygrad, only load necessary layers for each shard fixes #128, enable JIT (much faster), prefill all layers not just the first shard fixes #12, use new ShardDownloader for more robust, parallel downloads hace 9 meses
  Alex Cheema 55bcad98e3 standardise tinygrad models/tokenizers so it can handle mlx hf hace 9 meses
  Alex Cheema 4cb36a7f55 increase max line length to 200 hace 9 meses
  Alex Cheema ce761038ac formatting / linting hace 9 meses
  Alex Cheema 46d618abed tiny fixes hace 9 meses
  Alex Cheema dd8d18128c add an opaque inference_state that inference engines can use to pass around small state to other devices hace 9 meses
  Alex Cheema 5bbde22a23 move everything under exo module hace 9 meses