Much more info here: https://llm-tracker.info/_TOORG/Strix-Halo
+
#### Additional Resources
+
- Deep dive into LLM usage on Strix Halo: https://llm-tracker.info/_TOORG/Strix-Halo
+
- Newbie Linux inference guide: https://github.com/renaudrenaud/local_inference
### Image/Video Generation
Didn't play with it too much yet, but looks like here the memory bandwidth and GPU performance limitations strike the most. With SDXL you can generate an image every 4-5 seconds, but going to something like Flux will lead to wait times of several minutes.