HACKOBAR_item
[arXiv]score: 0.47

HiDream-O1-Image: A Natively Unified Image Generative Foundation Model with Pixel-level Unified Transformer

May 13, 2026
HiDream-O1-Image presents end-to-end pixel-space Diffusion Transformer unifying image generation, text encoding, and task conditioning into single token space, eliminating external VAE and text encoder dependencies.
cs.CVcs.MM