Picture for Hao Yang

Hao Yang

Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models

Add code
Apr 22, 2025
Viaarxiv icon

Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends

Add code
Apr 21, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Figure 1 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 2 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 3 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 4 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Viaarxiv icon

PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks

Add code
Apr 12, 2025
Figure 1 for PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks
Figure 2 for PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks
Figure 3 for PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks
Figure 4 for PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement

Add code
Apr 08, 2025
Viaarxiv icon

DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation

Add code
Apr 07, 2025
Viaarxiv icon

MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX

Add code
Mar 27, 2025
Figure 1 for MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Figure 2 for MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Figure 3 for MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Figure 4 for MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Viaarxiv icon

Adaptive Weighted Parameter Fusion with CLIP for Class-Incremental Learning

Add code
Mar 25, 2025
Viaarxiv icon

Sensorless Remote Center of Motion Misalignment Estimation

Add code
Mar 17, 2025
Figure 1 for Sensorless Remote Center of Motion Misalignment Estimation
Figure 2 for Sensorless Remote Center of Motion Misalignment Estimation
Figure 3 for Sensorless Remote Center of Motion Misalignment Estimation
Figure 4 for Sensorless Remote Center of Motion Misalignment Estimation
Viaarxiv icon