Picture for Shimin Tao

Shimin Tao

Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection

Add code
May 28, 2026
Viaarxiv icon

The GaoYao Benchmark: A Comprehensive Framework for Evaluating Multilingual and Multicultural Abilities of Large Language Models

Add code
Apr 22, 2026
Viaarxiv icon

Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation

Add code
Mar 26, 2026
Viaarxiv icon

Chart Specification: Structural Representations for Incentivizing VLM Reasoning in Chart-to-Code Generation

Add code
Feb 11, 2026
Viaarxiv icon

A method for improving multilingual quality and diversity of instruction fine-tuning datasets

Add code
Sep 19, 2025
Viaarxiv icon

CultureScope: A Dimensional Lens for Probing Cultural Understanding in LLMs

Add code
Sep 19, 2025
Viaarxiv icon

RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis

Add code
May 23, 2025
Figure 1 for MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis
Figure 2 for MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis
Figure 3 for MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis
Figure 4 for MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis
Viaarxiv icon

ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction

Add code
May 23, 2025
Viaarxiv icon

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement

Add code
Apr 08, 2025
Viaarxiv icon