Picture for Yujie Lu

Yujie Lu

SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation

Add code
Feb 22, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

TongSIM: A General Platform for Simulating Intelligent Machines

Add code
Dec 23, 2025
Viaarxiv icon

S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test

Add code
Dec 23, 2025
Figure 1 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 2 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 3 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 4 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Viaarxiv icon

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

Add code
May 26, 2025
Viaarxiv icon

VITED: Video Temporal Evidence Distillation

Add code
Mar 17, 2025
Viaarxiv icon

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Add code
Jun 16, 2024
Viaarxiv icon

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Add code
Jun 12, 2024
Figure 1 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 2 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 3 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 4 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Viaarxiv icon

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Add code
Jun 11, 2024
Figure 1 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 2 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 3 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 4 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Viaarxiv icon

From Text to Pixel: Advancing Long-Context Understanding in MLLMs

Add code
May 23, 2024
Figure 1 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 2 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 3 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 4 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Viaarxiv icon