Picture for Yi Xin

Yi Xin

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Add code
Dec 25, 2025
Viaarxiv icon

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation

Add code
Aug 06, 2025
Viaarxiv icon

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Add code
Jul 30, 2025
Viaarxiv icon

Low-Cost Test-Time Adaptation for Robust Video Editing

Add code
Jul 29, 2025
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Figure 1 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 2 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 3 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 4 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Viaarxiv icon

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Add code
Jul 17, 2025
Figure 1 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 2 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 3 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 4 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Viaarxiv icon

Partitioner Guided Modal Learning Framework

Add code
Jul 15, 2025
Viaarxiv icon

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon