Picture for Wei Sun

Wei Sun

Max

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

Add code
Feb 04, 2025
Viaarxiv icon

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment

Add code
Jan 30, 2025
Figure 1 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 2 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 3 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Figure 4 for AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
Viaarxiv icon

Do We Really Need to Design New Byzantine-robust Aggregation Rules?

Add code
Jan 29, 2025
Viaarxiv icon

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Add code
Jan 23, 2025
Viaarxiv icon

Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation

Add code
Nov 29, 2024
Viaarxiv icon

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

Add code
Nov 25, 2024
Figure 1 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 2 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 3 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 4 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Viaarxiv icon

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Figure 1 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 2 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 3 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 4 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Viaarxiv icon

MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator

Add code
Oct 14, 2024
Viaarxiv icon

MOLA: Enhancing Industrial Process Monitoring Using Multi-Block Orthogonal Long Short-Term Memory Autoencoder

Add code
Oct 10, 2024
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Figure 1 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 2 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 3 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 4 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Viaarxiv icon