Alex Cheema 9c1bea97e8 fix embed_tokens for last layer in qwen models 3 miesięcy temu
..
models 9c1bea97e8 fix embed_tokens for last layer in qwen models 3 miesięcy temu
__init__.py 5bbde22a23 move everything under exo module 9 miesięcy temu
losses.py 38e368f00b Fixed up the ops so that batches work 4 miesięcy temu
perf_improvements.md c9ded9ba96 optimise networking, remove bloat 4 miesięcy temu
sharded_inference_engine.py af171f06fa propagate prompts to other nodes so they can display them, cleaner prompt/output output 3 miesięcy temu
sharded_utils.py 9986fb86d4 remove prints and fix download progress for SD 4 miesięcy temu
test_non_blocking.py 3c7bd48aa3 get rid of some more hf bloat 3 miesięcy temu
test_sharded_model.py f53056dede more compact operator formatting 8 miesięcy temu