Picture for Weidi Xie

Weidi Xie

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Add code
May 22, 2025
Viaarxiv icon

Multi-Agent System for Comprehensive Soccer Understanding

Add code
May 06, 2025
Viaarxiv icon

ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification

Add code
Apr 29, 2025
Viaarxiv icon

Learning Streaming Video Representation via Multitask Training

Add code
Apr 28, 2025
Viaarxiv icon

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

Add code
Apr 16, 2025
Viaarxiv icon

Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation

Add code
Apr 01, 2025
Viaarxiv icon

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Add code
Mar 06, 2025
Figure 1 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Figure 2 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Viaarxiv icon

RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining

Add code
Mar 06, 2025
Viaarxiv icon

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Add code
Mar 02, 2025
Viaarxiv icon

M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging

Add code
Feb 27, 2025
Viaarxiv icon