Alert button
Picture for Zhou Zhao

Zhou Zhao

Alert button

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models

Add code
Bookmark button
Alert button
Oct 15, 2023
Zijian Zhang, Luping Liu. Zhijie Lin, Yichen Zhu, Zhou Zhao

Viaarxiv icon

Extending Multi-modal Contrastive Representations

Add code
Bookmark button
Alert button
Oct 13, 2023
Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao

Figure 1 for Extending Multi-modal Contrastive Representations
Figure 2 for Extending Multi-modal Contrastive Representations
Figure 3 for Extending Multi-modal Contrastive Representations
Figure 4 for Extending Multi-modal Contrastive Representations
Viaarxiv icon

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Add code
Bookmark button
Alert button
Oct 11, 2023
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng

Figure 1 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 2 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 3 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 4 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Viaarxiv icon

Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer

Add code
Bookmark button
Alert button
Sep 14, 2023
Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao

Figure 1 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 2 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 3 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 4 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Viaarxiv icon

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models

Add code
Bookmark button
Alert button
Aug 28, 2023
Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Figure 1 for TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
Figure 2 for TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
Figure 3 for TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
Figure 4 for TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
Viaarxiv icon

Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes

Add code
Bookmark button
Alert button
Aug 17, 2023
Zehan Wang, Haifeng Huang, Yang Zhao, Ziang Zhang, Zhou Zhao

Figure 1 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 2 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 3 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 4 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Viaarxiv icon

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding

Add code
Bookmark button
Alert button
Jul 25, 2023
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao

Figure 1 for 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Figure 2 for 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Figure 3 for 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Figure 4 for 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Viaarxiv icon

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Add code
Bookmark button
Alert button
Jul 19, 2023
Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou Zhao, Zhenhua Dong, Ruiqi Li, Lichao Zhang, Fei Wu

Figure 1 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 2 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 3 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 4 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Viaarxiv icon

Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding

Add code
Bookmark button
Alert button
Jul 18, 2023
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao

Figure 1 for Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
Figure 2 for Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
Figure 3 for Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
Figure 4 for Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
Viaarxiv icon

Gloss Attention for Gloss-free Sign Language Translation

Add code
Bookmark button
Alert button
Jul 14, 2023
Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao

Figure 1 for Gloss Attention for Gloss-free Sign Language Translation
Figure 2 for Gloss Attention for Gloss-free Sign Language Translation
Figure 3 for Gloss Attention for Gloss-free Sign Language Translation
Figure 4 for Gloss Attention for Gloss-free Sign Language Translation
Viaarxiv icon