Picture for Shimin Tao

Shimin Tao

A method for improving multilingual quality and diversity of instruction fine-tuning datasets

Add code
Sep 19, 2025
Viaarxiv icon

CultureScope: A Dimensional Lens for Probing Cultural Understanding in LLMs

Add code
Sep 19, 2025
Viaarxiv icon

RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction

Add code
May 23, 2025
Viaarxiv icon

MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis

Add code
May 23, 2025
Viaarxiv icon

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement

Add code
Apr 08, 2025
Viaarxiv icon

Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion

Add code
Mar 15, 2025
Figure 1 for Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
Figure 2 for Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
Figure 3 for Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
Figure 4 for Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
Viaarxiv icon

R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

Add code
Feb 27, 2025
Figure 1 for R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
Figure 2 for R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
Figure 3 for R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
Figure 4 for R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
Viaarxiv icon

M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models

Add code
Dec 24, 2024
Figure 1 for M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models
Figure 2 for M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models
Figure 3 for M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models
Figure 4 for M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models
Viaarxiv icon

Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge

Add code
Dec 02, 2024
Viaarxiv icon