Unified Latent Space
A single DiT architecture allows doubao to understand and generate content with 92% visual-textual alignment.

Unified Latent Space
A single DiT architecture allows doubao to understand and generate content with 92% visual-textual alignment.







