Picture for Dongfu Jiang

Dongfu Jiang

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Add code
May 26, 2025
Viaarxiv icon

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Add code
May 22, 2025
Viaarxiv icon

General-Reasoner: Advancing LLM Reasoning Across All Domains

Add code
May 21, 2025
Viaarxiv icon

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Add code
Feb 03, 2025
Viaarxiv icon

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Add code
Oct 14, 2024
Figure 1 for MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Figure 2 for MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Figure 3 for MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Figure 4 for MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Viaarxiv icon

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Add code
Jun 24, 2024
Figure 1 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 2 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 3 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 4 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Viaarxiv icon

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Add code
Jun 16, 2024
Viaarxiv icon

GenAI Arena: An Open Evaluation Platform for Generative Models

Add code
Jun 06, 2024
Viaarxiv icon

MANTIS: Interleaved Multi-Image Instruction Tuning

Add code
May 02, 2024
Figure 1 for MANTIS: Interleaved Multi-Image Instruction Tuning
Figure 2 for MANTIS: Interleaved Multi-Image Instruction Tuning
Figure 3 for MANTIS: Interleaved Multi-Image Instruction Tuning
Figure 4 for MANTIS: Interleaved Multi-Image Instruction Tuning
Viaarxiv icon

VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation

Add code
Dec 22, 2023
Figure 1 for VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Figure 2 for VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Figure 3 for VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Figure 4 for VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Viaarxiv icon