Alert button
Picture for Taihao Li

Taihao Li

Alert button

CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models

Dec 20, 2023
Dan Shi, Chaobin You, Jiantao Huang, Taihao Li, Deyi Xiong

Viaarxiv icon

RedCore: Relative Advantage Aware Cross-modal Representation Learning for Missing Modalities with Imbalanced Missing Rates

Dec 16, 2023
Jun Sun, Xinxin Zhang, Shoukang Han, Yu-ping Ruan, Taihao Li

Viaarxiv icon

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

Dec 01, 2023
Fukun Yin, Xin Chen, Chi Zhang, Biao Jiang, Zibo Zhao, Jiayuan Fan, Gang Yu, Taihao Li, Tao Chen

Figure 1 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 2 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 3 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 4 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Viaarxiv icon

Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection

Sep 21, 2023
Rui Tao, Yuxing Huang, Xiangdong Wang, Long Yan, Lufeng Zhai, Kazushige Ouchi, Taihao Li

Figure 1 for Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection
Figure 2 for Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection
Figure 3 for Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection
Figure 4 for Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Sep 06, 2023
Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen

Figure 1 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 2 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 3 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 4 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Viaarxiv icon

Disentangling Prosody Representations with Unsupervised Speech Reconstruction

Dec 14, 2022
Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter

Figure 1 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 2 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 3 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 4 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Viaarxiv icon

Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models

Dec 09, 2022
Wang Qi, Yu-Ping Ruan, Yuan Zuo, Taihao Li

Figure 1 for Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Figure 2 for Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Figure 3 for Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Figure 4 for Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Viaarxiv icon

Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition

Nov 16, 2022
Leyuan Qu, Wei Wang, Taihao Li, Cornelius Weber, Stefan Wermter, Fuji Ren

Figure 1 for Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition
Figure 2 for Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition
Figure 3 for Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition
Figure 4 for Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition
Viaarxiv icon