Picture for Yu Zhao

Yu Zhao

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Add code
Aug 27, 2025
Viaarxiv icon

Technical Report of TeleChat2, TeleChat2.5 and T1

Add code
Jul 24, 2025
Viaarxiv icon

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models

Add code
Jul 16, 2025
Viaarxiv icon

Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Add code
Jun 10, 2025
Viaarxiv icon

Chain-of-Lure: A Synthetic Narrative-Driven Approach to Compromise Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Table-R1: Region-based Reinforcement Learning for Table Understanding

Add code
May 18, 2025
Viaarxiv icon

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

Add code
May 17, 2025
Viaarxiv icon

MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence

Add code
May 15, 2025
Viaarxiv icon

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Add code
May 15, 2025
Viaarxiv icon

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Add code
Apr 22, 2025
Viaarxiv icon