Picture for Xin Yan

Xin Yan

STaR-Quant: State-Time Consistent Post-Training Quantization for Diffusion Large Language Models

Add code
Jun 03, 2026
Viaarxiv icon

R3-VAE: Reference Vector-Guided Rating Residual Quantization VAE for Generative Recommendation

Add code
Apr 14, 2026
Viaarxiv icon

AutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure Design

Add code
Mar 28, 2026
Viaarxiv icon

HBVLA: Pushing 1-Bit Post-Training Quantization for Vision-Language-Action Models

Add code
Feb 14, 2026
Viaarxiv icon

MindChat: A Privacy-preserving Large Language Model for Mental Health Support

Add code
Jan 05, 2026
Viaarxiv icon

ReMA: A Training-Free Plug-and-Play Mixing Augmentation for Video Behavior Recognition

Add code
Jan 01, 2026
Viaarxiv icon

Disentangling Emotional Bases and Transient Fluctuations: A Low-Rank Sparse Decomposition Approach for Video Affective Analysis

Add code
Nov 14, 2025
Viaarxiv icon

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Add code
Apr 09, 2025
Figure 1 for Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Figure 2 for Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Figure 3 for Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Figure 4 for Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Viaarxiv icon

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation

Add code
Dec 02, 2024
Figure 1 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 2 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 3 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 4 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Viaarxiv icon

RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text

Add code
May 30, 2024
Figure 1 for RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Figure 2 for RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Figure 3 for RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Figure 4 for RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Viaarxiv icon