Picture for An-Lan Wang

An-Lan Wang

TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

Add code
Feb 26, 2026
Viaarxiv icon

Advancing Sequential Numerical Prediction in Autoregressive Models

Add code
May 19, 2025
Viaarxiv icon

WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?

Add code
May 16, 2025
Viaarxiv icon

Task-Oriented 6-DoF Grasp Pose Detection in Clutters

Add code
Feb 24, 2025
Figure 1 for Task-Oriented 6-DoF Grasp Pose Detection in Clutters
Figure 2 for Task-Oriented 6-DoF Grasp Pose Detection in Clutters
Figure 3 for Task-Oriented 6-DoF Grasp Pose Detection in Clutters
Figure 4 for Task-Oriented 6-DoF Grasp Pose Detection in Clutters
Viaarxiv icon

TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching

Add code
Nov 26, 2024
Figure 1 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 2 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 3 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 4 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Viaarxiv icon

MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark

Add code
Oct 15, 2024
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon

EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

Add code
Jun 13, 2024
Viaarxiv icon

Event-Guided Procedure Planning from Instructional Videos with Text Supervision

Add code
Aug 17, 2023
Viaarxiv icon