Picture for Yu Zhao

Yu Zhao

Technical Report of TeleChat2, TeleChat2.5 and T1

Add code
Jul 24, 2025
Viaarxiv icon

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models

Add code
Jul 16, 2025
Viaarxiv icon

Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Add code
Jun 10, 2025
Viaarxiv icon

Chain-of-Lure: A Synthetic Narrative-Driven Approach to Compromise Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Table-R1: Region-based Reinforcement Learning for Table Understanding

Add code
May 18, 2025
Viaarxiv icon

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

Add code
May 17, 2025
Viaarxiv icon

MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence

Add code
May 15, 2025
Viaarxiv icon

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Add code
May 15, 2025
Viaarxiv icon

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Add code
Apr 22, 2025
Viaarxiv icon

LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs

Add code
Apr 11, 2025
Viaarxiv icon