Picture for Qin Jin

Qin Jin

Renmin University of China

What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation

Add code
Aug 26, 2024
Figure 1 for What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Figure 2 for What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Figure 3 for What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Figure 4 for What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Viaarxiv icon

UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos

Add code
Jun 24, 2024
Figure 1 for UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos
Figure 2 for UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos
Figure 3 for UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos
Figure 4 for UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos
Viaarxiv icon

QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds

Add code
Jun 24, 2024
Figure 1 for QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds
Figure 2 for QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds
Figure 3 for QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds
Figure 4 for QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds
Viaarxiv icon

SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models

Add code
Jun 20, 2024
Figure 1 for SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models
Figure 2 for SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models
Figure 3 for SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models
Viaarxiv icon

SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction

Add code
Jun 16, 2024
Figure 1 for SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction
Figure 2 for SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction
Figure 3 for SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction
Figure 4 for SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction
Viaarxiv icon

ESCoT: Towards Interpretable Emotional Support Dialogue Systems

Add code
Jun 16, 2024
Figure 1 for ESCoT: Towards Interpretable Emotional Support Dialogue Systems
Figure 2 for ESCoT: Towards Interpretable Emotional Support Dialogue Systems
Figure 3 for ESCoT: Towards Interpretable Emotional Support Dialogue Systems
Figure 4 for ESCoT: Towards Interpretable Emotional Support Dialogue Systems
Viaarxiv icon

Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition

Add code
Jun 13, 2024
Viaarxiv icon

TokSing: Singing Voice Synthesis based on Discrete Tokens

Add code
Jun 12, 2024
Figure 1 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 2 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 3 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Figure 4 for TokSing: Singing Voice Synthesis based on Discrete Tokens
Viaarxiv icon

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

Add code
Jun 11, 2024
Figure 1 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 2 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 3 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Figure 4 for The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Viaarxiv icon

EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?

Add code
May 28, 2024
Viaarxiv icon