Articles tagged: "torchao"
-
Training a Pixel-Space DiT in 26 Hours: FP8 Breakthroughs and Architectural Dead Ends
Following our integration of Asymmetric Flow Matching, our 400M parameter NanoDiT was training efficiently in terms of step-count convergence, but it was hitting an iter_per_sec of 0.025 on our single...