Articles tagged: "diffusion-transformers"
-
Training a Portrait DiT on a Single GPU: What the Ablation Study Taught Us
The prevailing assumption in generative AI is that training a large, multi-modal Diffusion Transformer from scratch requires a cluster. prx-tg is a direct challenge to that assumption: a 400M+ parameter... -
Eradicating Syntax: Building a Neural Universal Machine That Executes Graphs, Not Code
Our GNN autoencoders achieved 81% node accuracy on Ruby ASTs yet produced 0% valid code. The culprit was the literal value bottleneck — nearly half of every AST consisted of...