Picture for Yu Zhou

Yu Zhou

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, Fanyu AI Laboratory, Zhongke Fanyu Technology Co., Ltd, Beijing, China

CROP: Contextual Region-Oriented Visual Token Pruning

Add code
May 27, 2025
Viaarxiv icon

RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images

Add code
May 27, 2025
Viaarxiv icon

CODE-DITING: A Reasoning-Based Metric for Functional Alignment in Code Evaluation

Add code
May 26, 2025
Viaarxiv icon

The Role of Video Generation in Enhancing Data-Limited Action Understanding

Add code
May 26, 2025
Figure 1 for The Role of Video Generation in Enhancing Data-Limited Action Understanding
Figure 2 for The Role of Video Generation in Enhancing Data-Limited Action Understanding
Figure 3 for The Role of Video Generation in Enhancing Data-Limited Action Understanding
Figure 4 for The Role of Video Generation in Enhancing Data-Limited Action Understanding
Viaarxiv icon

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Add code
May 22, 2025
Viaarxiv icon

The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection

Add code
May 21, 2025
Viaarxiv icon

FedRE: Robust and Effective Federated Learning with Privacy Preference

Add code
May 08, 2025
Figure 1 for FedRE: Robust and Effective Federated Learning with Privacy Preference
Figure 2 for FedRE: Robust and Effective Federated Learning with Privacy Preference
Figure 3 for FedRE: Robust and Effective Federated Learning with Privacy Preference
Figure 4 for FedRE: Robust and Effective Federated Learning with Privacy Preference
Viaarxiv icon

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Add code
Apr 30, 2025
Figure 1 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 2 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 3 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 4 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Viaarxiv icon

MTCSC: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction

Add code
Apr 26, 2025
Viaarxiv icon

StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation

Add code
Apr 22, 2025
Viaarxiv icon