Picture for Xiaofeng Zhang

Xiaofeng Zhang

Diffusion-CAM: Faithful Visual Explanations for dMLLMs

Add code
Apr 13, 2026
Viaarxiv icon

Reasoning Fails Where Step Flow Breaks

Add code
Apr 08, 2026
Viaarxiv icon

ART: Attention Replacement Technique to Improve Factuality in LLMs

Add code
Apr 07, 2026
Viaarxiv icon

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

Add code
Feb 26, 2026
Viaarxiv icon

C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

Add code
Feb 16, 2026
Viaarxiv icon

Hallucination Begins Where Saliency Drops

Add code
Jan 28, 2026
Viaarxiv icon

Context Tokens are Anchors: Understanding the Repetition Curse in dMLLMs from an Information Flow Perspective

Add code
Jan 28, 2026
Viaarxiv icon

Inference-time Physics Alignment of Video Generative Models with Latent World Models

Add code
Jan 15, 2026
Viaarxiv icon

Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual Exposure

Add code
Dec 19, 2025
Figure 1 for Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual Exposure
Figure 2 for Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual Exposure
Figure 3 for Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual Exposure
Viaarxiv icon

D$^{3}$ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs

Add code
Nov 15, 2025
Viaarxiv icon