Picture for Siqi Luo

Siqi Luo

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Add code
Dec 25, 2025
Viaarxiv icon

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Viaarxiv icon

LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation

Add code
Aug 06, 2025
Viaarxiv icon

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Add code
Jul 30, 2025
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Figure 1 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 2 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 3 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 4 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Viaarxiv icon

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Add code
Jul 17, 2025
Figure 1 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 2 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 3 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 4 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Viaarxiv icon

Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

Add code
Sep 17, 2024
Figure 1 for Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending
Figure 2 for Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending
Figure 3 for Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending
Figure 4 for Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending
Viaarxiv icon

Enhancing Test Time Adaptation with Few-shot Guidance

Add code
Sep 02, 2024
Figure 1 for Enhancing Test Time Adaptation with Few-shot Guidance
Figure 2 for Enhancing Test Time Adaptation with Few-shot Guidance
Figure 3 for Enhancing Test Time Adaptation with Few-shot Guidance
Figure 4 for Enhancing Test Time Adaptation with Few-shot Guidance
Viaarxiv icon

D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Add code
Jun 18, 2024
Figure 1 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 2 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 3 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 4 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Viaarxiv icon

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Add code
Feb 08, 2024
Viaarxiv icon