Picture for Yujie Lu

Yujie Lu

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Add code
Jun 16, 2024
Viaarxiv icon

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Add code
Jun 12, 2024
Viaarxiv icon

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Add code
Jun 11, 2024
Viaarxiv icon

From Text to Pixel: Advancing Long-Context Understanding in MLLMs

Add code
May 23, 2024
Viaarxiv icon

Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)

Add code
Apr 05, 2024
Viaarxiv icon

Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

Add code
Mar 03, 2024
Figure 1 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 2 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 3 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Figure 4 for Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
Viaarxiv icon

VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following

Add code
Nov 29, 2023
Figure 1 for VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following
Figure 2 for VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following
Figure 3 for VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following
Figure 4 for VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Figure 1 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 2 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 3 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 4 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Viaarxiv icon

ImagenHub: Standardizing the evaluation of conditional image generation models

Add code
Oct 17, 2023
Figure 1 for ImagenHub: Standardizing the evaluation of conditional image generation models
Figure 2 for ImagenHub: Standardizing the evaluation of conditional image generation models
Figure 3 for ImagenHub: Standardizing the evaluation of conditional image generation models
Figure 4 for ImagenHub: Standardizing the evaluation of conditional image generation models
Viaarxiv icon

Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting

Add code
Oct 11, 2023
Figure 1 for Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
Figure 2 for Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
Figure 3 for Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
Figure 4 for Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
Viaarxiv icon