@Modular
You shouldn't have to choose between peak GPU performance and code you can actually maintain. We built Structured Mojo 🔥 Kernels to fix that. Performance, usability, and portability without the tradeoff. 14k to 7k lines. ~1.8k TFLOPS held. We wrote a 4-part series on how. Part 1 is up https://t.co/zMYWMfDOb2