Picture for Pinxin Liu

Pinxin Liu

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Add code
May 26, 2025
Viaarxiv icon

$I^2G$: Generating Instructional Illustrations via Text-Conditioned Diffusion

Add code
May 22, 2025
Viaarxiv icon

Intentional Gesture: Deliver Your Intentions with Gestures for Speech

Add code
May 21, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Add code
Apr 14, 2025
Viaarxiv icon

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Add code
Apr 09, 2025
Viaarxiv icon

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)

Add code
Apr 04, 2025
Viaarxiv icon

Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation

Add code
Feb 11, 2025
Viaarxiv icon

GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling

Add code
Jan 31, 2025
Viaarxiv icon

Generative AI for Cel-Animation: A Survey

Add code
Jan 08, 2025
Viaarxiv icon

KinMo: Kinematic-aware Human Motion Understanding and Generation

Add code
Nov 23, 2024
Viaarxiv icon