Picture for Peijin Xie

Peijin Xie

M$^3$-ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering

Add code
Mar 09, 2026
Viaarxiv icon

TextlessRAG: End-to-End Visual Document RAG by Speech Without Text

Add code
Sep 10, 2025
Viaarxiv icon

Expand VSR Benchmark for VLLM to Expertize in Spatial Rules

Add code
Dec 24, 2024
Figure 1 for Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Figure 2 for Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Figure 3 for Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Figure 4 for Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Viaarxiv icon