Picture for Chenghao Xiao

Chenghao Xiao

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts

Add code
Apr 29, 2025
Viaarxiv icon

Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Add code
Apr 18, 2025
Viaarxiv icon

MIEB: Massive Image Embedding Benchmark

Add code
Apr 14, 2025
Viaarxiv icon

MMTEB: Massive Multilingual Text Embedding Benchmark

Add code
Feb 19, 2025
Viaarxiv icon

Everything is a Video: Unifying Modalities through Next-Frame Prediction

Add code
Nov 15, 2024
Viaarxiv icon

CAST: Corpus-Aware Self-similarity Enhanced Topic modelling

Add code
Oct 19, 2024
Figure 1 for CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
Figure 2 for CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
Figure 3 for CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
Figure 4 for CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
Viaarxiv icon

On the Rigour of Scientific Writing: Criteria, Analysis, and Insights

Add code
Oct 07, 2024
Viaarxiv icon

BioMNER: A Dataset for Biomedical Method Entity Recognition

Add code
Jun 28, 2024
Viaarxiv icon