Picture for Xiao Zhang

Xiao Zhang

KnowledgeBerg: Evaluating Systematic Knowledge Coverage and Compositional Reasoning in Large Language Models

Add code
Apr 19, 2026
Viaarxiv icon

CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas

Add code
Apr 16, 2026
Viaarxiv icon

WaveMoE: A Wavelet-Enhanced Mixture-of-Experts Foundation Model for Time Series Forecasting

Add code
Apr 12, 2026
Viaarxiv icon

Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

Add code
Apr 10, 2026
Viaarxiv icon

DetailVerifyBench: A Benchmark for Dense Hallucination Localization in Long Image Captions

Add code
Apr 07, 2026
Viaarxiv icon

Learning from Many and Adapting to the Unknown in Open-set Test Streams

Add code
Apr 01, 2026
Viaarxiv icon

Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement

Add code
Mar 30, 2026
Viaarxiv icon

HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization

Add code
Mar 17, 2026
Viaarxiv icon

Bringing Model Editing to Generative Recommendation in Cold-Start Scenarios

Add code
Mar 15, 2026
Viaarxiv icon

HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

Add code
Mar 12, 2026
Viaarxiv icon