Picture for Rongchao Zhang

Rongchao Zhang

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

PMMD: A pose-guided multi-view multi-modal diffusion for person generation

Add code
Dec 17, 2025
Figure 1 for PMMD: A pose-guided multi-view multi-modal diffusion for person generation
Figure 2 for PMMD: A pose-guided multi-view multi-modal diffusion for person generation
Figure 3 for PMMD: A pose-guided multi-view multi-modal diffusion for person generation
Figure 4 for PMMD: A pose-guided multi-view multi-modal diffusion for person generation
Viaarxiv icon

Low-Cost Test-Time Adaptation for Robust Video Editing

Add code
Jul 29, 2025
Viaarxiv icon