Picture for Fei Zhao

Fei Zhao

Research on Audio-Visual Quality Assessment Dataset and Method for User-Generated Omnidirectional Video

Add code
Jun 12, 2025
Viaarxiv icon

Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model

Add code
Jun 05, 2025
Viaarxiv icon

Room Impulse Response as a Prompt for Acoustic Echo Cancellation

Add code
May 26, 2025
Viaarxiv icon

Multi-Channel Acoustic Echo Cancellation Based on Direction-of-Arrival Estimation

Add code
May 26, 2025
Viaarxiv icon

CompBench: Benchmarking Complex Instruction-guided Image Editing

Add code
May 18, 2025
Viaarxiv icon

MarkMatch: Same-Hand Stuffing Detection

Add code
May 11, 2025
Viaarxiv icon

Mitigating Image Captioning Hallucinations in Vision-Language Models

Add code
May 06, 2025
Viaarxiv icon

GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets

Add code
Apr 28, 2025
Viaarxiv icon

Redefining Machine Translation on Social Network Services with Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon