Picture for Pengcheng Li

Pengcheng Li

EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition

Add code
Sep 19, 2025
Viaarxiv icon

Predicting Artificial Neural Network Representations to Learn Recognition Model for Music Identification from Brain Recordings

Add code
Dec 20, 2024
Viaarxiv icon

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Add code
Sep 29, 2024
Figure 1 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 2 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 3 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Figure 4 for IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
Viaarxiv icon

TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings

Add code
Sep 15, 2024
Figure 1 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 2 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 3 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 4 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Viaarxiv icon

IVGF: The Fusion-Guided Infrared and Visible General Framework

Add code
Sep 02, 2024
Figure 1 for IVGF: The Fusion-Guided Infrared and Visible General Framework
Figure 2 for IVGF: The Fusion-Guided Infrared and Visible General Framework
Figure 3 for IVGF: The Fusion-Guided Infrared and Visible General Framework
Figure 4 for IVGF: The Fusion-Guided Infrared and Visible General Framework
Viaarxiv icon

Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation

Add code
Aug 21, 2024
Figure 1 for Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation
Figure 2 for Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation
Figure 3 for Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation
Figure 4 for Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation
Viaarxiv icon

A Point-Neighborhood Learning Framework for Nasal Endoscope Image Segmentation

Add code
May 30, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Figure 1 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 2 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 3 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 4 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Viaarxiv icon

CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition

Add code
Apr 30, 2024
Viaarxiv icon

Medical Speech Symptoms Classification via Disentangled Representation

Add code
Mar 08, 2024
Figure 1 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 2 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 3 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 4 for Medical Speech Symptoms Classification via Disentangled Representation
Viaarxiv icon