Picture for Yuanbo Fang

Yuanbo Fang

S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models

Add code
May 20, 2025
Figure 1 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 2 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 3 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 4 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Viaarxiv icon

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Add code
Feb 24, 2025
Figure 1 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 2 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 3 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 4 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Figure 1 for Baichuan-Omni-1.5 Technical Report
Figure 2 for Baichuan-Omni-1.5 Technical Report
Figure 3 for Baichuan-Omni-1.5 Technical Report
Figure 4 for Baichuan-Omni-1.5 Technical Report
Viaarxiv icon

Multi-Scale Temporal Transformer For Speech Emotion Recognition

Add code
Oct 01, 2024
Figure 1 for Multi-Scale Temporal Transformer For Speech Emotion Recognition
Figure 2 for Multi-Scale Temporal Transformer For Speech Emotion Recognition
Figure 3 for Multi-Scale Temporal Transformer For Speech Emotion Recognition
Figure 4 for Multi-Scale Temporal Transformer For Speech Emotion Recognition
Viaarxiv icon