Picture for Yiqing Shen

Yiqing Shen

Temporally-Constrained Video Reasoning Segmentation and Automated Benchmark Construction

Add code
Jul 22, 2025
Viaarxiv icon

Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations

Add code
Jun 09, 2025
Viaarxiv icon

Reasoning Segmentation for Images and Videos: A Survey

Add code
May 24, 2025
Viaarxiv icon

Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Add code
May 22, 2025
Viaarxiv icon

RVTBench: A Benchmark for Visual Reasoning Tasks

Add code
May 17, 2025
Viaarxiv icon

Position: Foundation Models Need Digital Twin Representations

Add code
May 01, 2025
Viaarxiv icon

AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers

Add code
Apr 28, 2025
Figure 1 for AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
Figure 2 for AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
Figure 3 for AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
Figure 4 for AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
Viaarxiv icon

Online Reasoning Video Segmentation with Just-in-Time Digital Twins

Add code
Mar 27, 2025
Viaarxiv icon