Picture for Kaiyuan Liu

Kaiyuan Liu

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion

Add code
Feb 25, 2026
Viaarxiv icon

Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Add code
Jan 14, 2026
Viaarxiv icon

MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus

Add code
Jan 14, 2026
Viaarxiv icon

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Add code
Dec 24, 2025
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation

Add code
Aug 10, 2025
Viaarxiv icon

Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models

Add code
Jun 14, 2025
Figure 1 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 2 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 3 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 4 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Viaarxiv icon

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Add code
Jun 13, 2025
Viaarxiv icon

GeoCAD: Local Geometry-Controllable CAD Generation

Add code
Jun 12, 2025
Figure 1 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 2 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 3 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 4 for GeoCAD: Local Geometry-Controllable CAD Generation
Viaarxiv icon

ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation

Add code
Mar 10, 2025
Figure 1 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 2 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 3 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 4 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Viaarxiv icon