Picture for Yuqian Fu

Yuqian Fu

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer

Add code
Mar 15, 2026
Viaarxiv icon

InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing

Add code
Mar 13, 2026
Viaarxiv icon

VisNec: Measuring and Leveraging Visual Necessity for Multimodal Instruction Tuning

Add code
Mar 01, 2026
Viaarxiv icon

EgoSound: Benchmarking Sound Understanding in Egocentric Videos

Add code
Feb 15, 2026
Viaarxiv icon

CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

Add code
Nov 19, 2025
Viaarxiv icon

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Add code
Oct 29, 2025
Viaarxiv icon

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Add code
Oct 08, 2025
Viaarxiv icon

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Add code
Sep 11, 2025
Viaarxiv icon

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Add code
Jun 24, 2025
Viaarxiv icon

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Add code
Jun 23, 2025
Viaarxiv icon