Picture for Xiangyu Kong

Xiangyu Kong

RA-CLAP: Relation-Augmented Emotional Speaking Style Contrastive Language-Audio Pretraining For Speech Retrieval

Add code
May 26, 2025
Viaarxiv icon

REACT 2025: the Third Multiple Appropriate Facial Reaction Generation Challenge

Add code
May 22, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results

Add code
Apr 20, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Image Denoising Challenge Report

Add code
Apr 16, 2025
Viaarxiv icon

Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition

Add code
Aug 01, 2024
Figure 1 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 2 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 3 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 4 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Viaarxiv icon

Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

Add code
Jul 12, 2024
Figure 1 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 2 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 3 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 4 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Viaarxiv icon

Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy

Add code
Jul 09, 2024
Viaarxiv icon

MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

Add code
Jun 11, 2024
Figure 1 for MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results
Figure 2 for MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results
Figure 3 for MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results
Figure 4 for MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results
Viaarxiv icon

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Add code
Jan 19, 2024
Figure 1 for CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Figure 2 for CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Figure 3 for CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Figure 4 for CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Viaarxiv icon

DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation

Add code
Mar 14, 2023
Figure 1 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 2 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 3 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 4 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Viaarxiv icon