Picture for Yanfeng Wang

Yanfeng Wang

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, China

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Add code
May 22, 2025
Viaarxiv icon

VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models

Add code
May 21, 2025
Viaarxiv icon

AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation

Add code
May 17, 2025
Viaarxiv icon

Controllable Image Colorization with Instance-aware Texts and Masks

Add code
May 13, 2025
Viaarxiv icon

Multi-Agent System for Comprehensive Soccer Understanding

Add code
May 06, 2025
Viaarxiv icon

Incentivizing Inclusive Contributions in Model Sharing Markets

Add code
May 05, 2025
Viaarxiv icon

ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification

Add code
Apr 29, 2025
Viaarxiv icon

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Add code
Apr 29, 2025
Viaarxiv icon

VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

Add code
Apr 05, 2025
Viaarxiv icon

COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking

Add code
Apr 02, 2025
Viaarxiv icon