Picture for Sheng Zhou

Sheng Zhou

Zhejiang University

Towards Scalable Web Accessibility Audit with MLLMs as Copilots

Add code
Nov 05, 2025
Viaarxiv icon

FedTeddi: Temporal Drift and Divergence Aware Scheduling for Timely Federated Edge Learning

Add code
Sep 09, 2025
Viaarxiv icon

Advancing Loss Functions in Recommender Systems: A Comparative Study with a Rényi Divergence-Based Solution

Add code
Jun 18, 2025
Viaarxiv icon

GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies

Add code
Jun 17, 2025
Figure 1 for GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
Figure 2 for GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
Figure 3 for GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
Figure 4 for GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
Viaarxiv icon

FedCGD: Collective Gradient Divergence Optimized Scheduling for Wireless Federated Learning

Add code
Jun 09, 2025
Viaarxiv icon

Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification

Add code
Jun 08, 2025
Viaarxiv icon

OpenGT: A Comprehensive Benchmark For Graph Transformers

Add code
Jun 05, 2025
Viaarxiv icon

Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models

Add code
May 26, 2025
Figure 1 for Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models
Figure 2 for Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models
Figure 3 for Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models
Figure 4 for Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models
Viaarxiv icon

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

Add code
May 24, 2025
Figure 1 for Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Figure 2 for Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Figure 3 for Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Figure 4 for Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Viaarxiv icon

FocusedAD: Character-centric Movie Audio Description

Add code
Apr 16, 2025
Figure 1 for FocusedAD: Character-centric Movie Audio Description
Figure 2 for FocusedAD: Character-centric Movie Audio Description
Figure 3 for FocusedAD: Character-centric Movie Audio Description
Figure 4 for FocusedAD: Character-centric Movie Audio Description
Viaarxiv icon