Picture for Xueqing Deng

Xueqing Deng

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Add code
Apr 14, 2025
Viaarxiv icon

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Add code
Feb 04, 2025
Viaarxiv icon

1.58-bit FLUX

Add code
Dec 24, 2024
Viaarxiv icon

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Add code
Dec 12, 2024
Viaarxiv icon

Randomized Autoregressive Visual Generation

Add code
Nov 01, 2024
Viaarxiv icon

MaskBit: Embedding-free Image Generation via Bit Tokens

Add code
Sep 24, 2024
Figure 1 for MaskBit: Embedding-free Image Generation via Bit Tokens
Figure 2 for MaskBit: Embedding-free Image Generation via Bit Tokens
Figure 3 for MaskBit: Embedding-free Image Generation via Bit Tokens
Figure 4 for MaskBit: Embedding-free Image Generation via Bit Tokens
Viaarxiv icon

An Image is Worth 32 Tokens for Reconstruction and Generation

Add code
Jun 11, 2024
Figure 1 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 2 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 3 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 4 for An Image is Worth 32 Tokens for Reconstruction and Generation
Viaarxiv icon

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Add code
Apr 16, 2024
Viaarxiv icon

COCONut: Modernizing COCO Segmentation

Add code
Apr 12, 2024
Viaarxiv icon

MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Add code
Nov 30, 2023
Viaarxiv icon