Alex Cheema 9c1bea97e8 fix embed_tokens for last layer in qwen models 3 months ago
..
models 9c1bea97e8 fix embed_tokens for last layer in qwen models 3 months ago
__init__.py 5bbde22a23 move everything under exo module 9 months ago
losses.py 38e368f00b Fixed up the ops so that batches work 4 months ago
perf_improvements.md c9ded9ba96 optimise networking, remove bloat 4 months ago
sharded_inference_engine.py af171f06fa propagate prompts to other nodes so they can display them, cleaner prompt/output output 3 months ago
sharded_utils.py 9986fb86d4 remove prints and fix download progress for SD 4 months ago
test_non_blocking.py 3c7bd48aa3 get rid of some more hf bloat 3 months ago
test_sharded_model.py f53056dede more compact operator formatting 8 months ago