Picture for Shun Qian

Shun Qian

TextlessRAG: End-to-End Visual Document RAG by Speech Without Text

Add code
Sep 10, 2025
Viaarxiv icon

Spatial-Aware Efficient Projector for MLLMs via Multi-Layer Feature Aggregation

Add code
Oct 14, 2024
Viaarxiv icon