Picture for Yufan Zhou

Yufan Zhou

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Add code
Jul 08, 2025
Viaarxiv icon

Towards Visual Text Grounding of Multimodal Large Language Model

Add code
Apr 07, 2025
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon

FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion

Add code
Feb 08, 2025
Viaarxiv icon

Scattering Environment Aware Joint Multi-user Channel Estimation and Localization with Spatially Reused Pilots

Add code
Jan 04, 2025
Viaarxiv icon

A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation

Add code
Dec 20, 2024
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Figure 1 for Numerical Pruning for Efficient Autoregressive Models
Figure 2 for Numerical Pruning for Efficient Autoregressive Models
Figure 3 for Numerical Pruning for Efficient Autoregressive Models
Figure 4 for Numerical Pruning for Efficient Autoregressive Models
Viaarxiv icon

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

TTVD: Towards a Geometric Framework for Test-Time Adaptation Based on Voronoi Diagram

Add code
Dec 10, 2024
Viaarxiv icon

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Viaarxiv icon