Picture for Yiqing Shen

Yiqing Shen

Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations

Add code
Jun 09, 2025
Viaarxiv icon

Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

Reasoning Segmentation for Images and Videos: A Survey

Add code
May 24, 2025
Viaarxiv icon

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Add code
May 22, 2025
Viaarxiv icon

RVTBench: A Benchmark for Visual Reasoning Tasks

Add code
May 17, 2025
Viaarxiv icon

Position: Foundation Models Need Digital Twin Representations

Add code
May 01, 2025
Viaarxiv icon

AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers

Add code
Apr 28, 2025
Viaarxiv icon

Online Reasoning Video Segmentation with Just-in-Time Digital Twins

Add code
Mar 27, 2025
Viaarxiv icon

Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins

Add code
Mar 26, 2025
Viaarxiv icon

TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement

Add code
Mar 11, 2025
Viaarxiv icon