Picture for Wei Sun

Wei Sun

Max

CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting

Add code
Apr 16, 2025
Viaarxiv icon

MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model

Add code
Apr 14, 2025
Viaarxiv icon

FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment

Add code
Apr 12, 2025
Viaarxiv icon

Novel Object 6D Pose Estimation with a Single Reference View

Add code
Mar 07, 2025
Viaarxiv icon

Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content

Add code
Mar 05, 2025
Viaarxiv icon

An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning

Add code
Mar 04, 2025
Viaarxiv icon

Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving

Add code
Feb 11, 2025
Viaarxiv icon

CT-UIO: Continuous-Time UWB-Inertial-Odometer Localization Using Non-Uniform B-spline with Fewer Anchors

Add code
Feb 10, 2025
Viaarxiv icon

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

Add code
Feb 04, 2025
Viaarxiv icon

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment

Add code
Jan 30, 2025
Figure 1 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 2 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 3 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 4 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Viaarxiv icon