Picture for Yujie Lu

Yujie Lu

S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test

Add code
Dec 23, 2025
Figure 1 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 2 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 3 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Figure 4 for S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test
Viaarxiv icon

TongSIM: A General Platform for Simulating Intelligent Machines

Add code
Dec 23, 2025
Viaarxiv icon

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

Add code
May 26, 2025
Viaarxiv icon

VITED: Video Temporal Evidence Distillation

Add code
Mar 17, 2025
Viaarxiv icon

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Add code
Jun 16, 2024
Viaarxiv icon

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Add code
Jun 12, 2024
Figure 1 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 2 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 3 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 4 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Viaarxiv icon

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Add code
Jun 11, 2024
Figure 1 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 2 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 3 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 4 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Viaarxiv icon

From Text to Pixel: Advancing Long-Context Understanding in MLLMs

Add code
May 23, 2024
Figure 1 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 2 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 3 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Figure 4 for From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Viaarxiv icon

Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)

Add code
Apr 05, 2024
Figure 1 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 2 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 3 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 4 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Viaarxiv icon

Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

Add code
Mar 03, 2024
Figure 1 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 2 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 3 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 4 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Viaarxiv icon