Picture for Qiyan Zhao

Qiyan Zhao

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

Add code
Feb 26, 2026
Viaarxiv icon

C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

Add code
Feb 16, 2026
Viaarxiv icon

Context Tokens are Anchors: Understanding the Repetition Curse in dMLLMs from an Information Flow Perspective

Add code
Jan 28, 2026
Viaarxiv icon

Hallucination Begins Where Saliency Drops

Add code
Jan 28, 2026
Viaarxiv icon