Alert button
Picture for Tao Jin

Tao Jin

Alert button

University of Science and Technology of China

TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

Dec 23, 2023
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, changpeng yang, Zhou Zhao

Viaarxiv icon

Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers

Dec 15, 2023
Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao

Figure 1 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 2 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 3 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 4 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Viaarxiv icon

Extending Multi-modal Contrastive Representations

Oct 13, 2023
Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao

Figure 1 for Extending Multi-modal Contrastive Representations
Figure 2 for Extending Multi-modal Contrastive Representations
Figure 3 for Extending Multi-modal Contrastive Representations
Figure 4 for Extending Multi-modal Contrastive Representations
Viaarxiv icon

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Oct 02, 2023
Qiwei Di, Tao Jin, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu

Figure 1 for Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
Viaarxiv icon

A Remote Sim2real Aerial Competition: Fostering Reproducibility and Solutions' Diversity in Robotics Challenges

Aug 31, 2023
Spencer Teetaert, Wenda Zhao, Niu Xinyuan, Hashir Zahir, Huiyu Leong, Michel Hidalgo, Gerardo Puga, Tomas Lorente, Nahuel Espinosa, John Alejandro Duarte Carrasco, Kaizheng Zhang, Jian Di, Tao Jin, Xiaohan Li, Yijia Zhou, Xiuhua Liang, Chenxu Zhang, Antonio Loquercio, Siqi Zhou, Lukas Brunke, Melissa Greeff, Wolfgang Hoenig, Jacopo Panerati, Angela P. Schoellig

Figure 1 for A Remote Sim2real Aerial Competition: Fostering Reproducibility and Solutions' Diversity in Robotics Challenges
Figure 2 for A Remote Sim2real Aerial Competition: Fostering Reproducibility and Solutions' Diversity in Robotics Challenges
Figure 3 for A Remote Sim2real Aerial Competition: Fostering Reproducibility and Solutions' Diversity in Robotics Challenges
Figure 4 for A Remote Sim2real Aerial Competition: Fostering Reproducibility and Solutions' Diversity in Robotics Challenges
Viaarxiv icon

Gloss Attention for Gloss-free Sign Language Translation

Jul 14, 2023
Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao

Figure 1 for Gloss Attention for Gloss-free Sign Language Translation
Figure 2 for Gloss Attention for Gloss-free Sign Language Translation
Figure 3 for Gloss Attention for Gloss-free Sign Language Translation
Figure 4 for Gloss Attention for Gloss-free Sign Language Translation
Viaarxiv icon

OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment

Jun 10, 2023
Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao

Figure 1 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 2 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 3 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 4 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Viaarxiv icon

DATE: Domain Adaptive Product Seeker for E-commerce

Apr 07, 2023
Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao

Figure 1 for DATE: Domain Adaptive Product Seeker for E-commerce
Figure 2 for DATE: Domain Adaptive Product Seeker for E-commerce
Figure 3 for DATE: Domain Adaptive Product Seeker for E-commerce
Figure 4 for DATE: Domain Adaptive Product Seeker for E-commerce
Viaarxiv icon

Borda Regret Minimization for Generalized Linear Dueling Bandits

Mar 15, 2023
Yue Wu, Tao Jin, Hao Lou, Farzad Farnoud, Quanquan Gu

Figure 1 for Borda Regret Minimization for Generalized Linear Dueling Bandits
Figure 2 for Borda Regret Minimization for Generalized Linear Dueling Bandits
Figure 3 for Borda Regret Minimization for Generalized Linear Dueling Bandits
Viaarxiv icon

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

Mar 09, 2023
Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao

Figure 1 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 2 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 3 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 4 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Viaarxiv icon