Picture for Zhaoye Fei

Zhaoye Fei

WESR: Scaling and Evaluating Word-level Event-Speech Recognition

Add code
Jan 08, 2026
Viaarxiv icon

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Add code
Jan 08, 2026
Viaarxiv icon

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Figure 1 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 2 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 3 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 4 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Viaarxiv icon

CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation

Add code
Aug 28, 2025
Figure 1 for CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation
Figure 2 for CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation
Figure 3 for CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation
Figure 4 for CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation
Viaarxiv icon

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Add code
Apr 12, 2025
Viaarxiv icon

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Add code
Mar 13, 2025
Viaarxiv icon

How to Mitigate Overfitting in Weak-to-strong Generalization?

Add code
Mar 06, 2025
Figure 1 for How to Mitigate Overfitting in Weak-to-strong Generalization?
Figure 2 for How to Mitigate Overfitting in Weak-to-strong Generalization?
Figure 3 for How to Mitigate Overfitting in Weak-to-strong Generalization?
Figure 4 for How to Mitigate Overfitting in Weak-to-strong Generalization?
Viaarxiv icon

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Figure 1 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 2 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 3 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 4 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon