Picture for Qihang Yu

Qihang Yu

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Add code
Apr 06, 2026
Viaarxiv icon

Autoregressive Image Generation with Masked Bit Modeling

Add code
Feb 09, 2026
Viaarxiv icon

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Add code
Jan 29, 2026
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Figure 1 for ThinkRec: Thinking-based recommendation via LLM
Figure 2 for ThinkRec: Thinking-based recommendation via LLM
Figure 3 for ThinkRec: Thinking-based recommendation via LLM
Figure 4 for ThinkRec: Thinking-based recommendation via LLM
Viaarxiv icon

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

Add code
May 20, 2025
Figure 1 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 2 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 3 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 4 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Viaarxiv icon

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Add code
Apr 30, 2025
Viaarxiv icon

FlowTok: Flowing Seamlessly Across Text and Image Tokens

Add code
Mar 13, 2025
Viaarxiv icon

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Add code
Feb 27, 2025
Viaarxiv icon

Dictionary-based Framework for Interpretable and Consistent Object Parsing

Add code
Feb 26, 2025
Viaarxiv icon

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Add code
Feb 04, 2025
Viaarxiv icon