Picture for Qin Jin

Qin Jin

Renmin University of China

Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model

Add code
Aug 11, 2025
Viaarxiv icon

POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering

Add code
Jul 16, 2025
Viaarxiv icon

A Survey of Deep Learning for Geometry Problem Solving

Add code
Jul 16, 2025
Viaarxiv icon

IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems

Add code
Jun 06, 2025
Viaarxiv icon

RTime-QA: A Benchmark for Atomic Temporal Event Understanding in Large Multi-modal Models

Add code
May 25, 2025
Viaarxiv icon

EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining

Add code
Mar 19, 2025
Figure 1 for EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Figure 2 for EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Figure 3 for EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Figure 4 for EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Viaarxiv icon

TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM

Add code
Mar 17, 2025
Viaarxiv icon

WritingBench: A Comprehensive Benchmark for Generative Writing

Add code
Mar 07, 2025
Viaarxiv icon

Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval

Add code
Dec 26, 2024
Figure 1 for Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval
Figure 2 for Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval
Figure 3 for Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval
Figure 4 for Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval
Viaarxiv icon

Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models

Add code
Oct 04, 2024
Figure 1 for Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models
Figure 2 for Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models
Figure 3 for Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models
Figure 4 for Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models
Viaarxiv icon