Picture for Tingyu Song

Tingyu Song

Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers

Add code
Jul 08, 2025
Viaarxiv icon

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Add code
May 29, 2025
Viaarxiv icon

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Add code
Mar 06, 2025
Figure 1 for IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Figure 2 for IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Figure 3 for IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Figure 4 for IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Viaarxiv icon