Alert button
Picture for Zhijie Yan

Zhijie Yan

Alert button

TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

Add code
Bookmark button
Alert button
Mar 28, 2024
Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao

Figure 1 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 2 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 3 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 4 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Viaarxiv icon

Large Language Models Powered Context-aware Motion Prediction

Add code
Bookmark button
Alert button
Mar 17, 2024
Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong

Figure 1 for Large Language Models Powered Context-aware Motion Prediction
Figure 2 for Large Language Models Powered Context-aware Motion Prediction
Figure 3 for Large Language Models Powered Context-aware Motion Prediction
Figure 4 for Large Language Models Powered Context-aware Motion Prediction
Viaarxiv icon

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures

Add code
Bookmark button
Alert button
Dec 19, 2023
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan

Viaarxiv icon

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Yunfei Chu, Jin Xu, Xiaohuan Zhou, Qian Yang, Shiliang Zhang, Zhijie Yan, Chang Zhou, Jingren Zhou

Viaarxiv icon

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

Add code
Bookmark button
Alert button
Oct 11, 2023
Jiaming Wang, Zhihao Du, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang

Figure 1 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 2 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 3 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 4 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Viaarxiv icon

The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR

Add code
Bookmark button
Alert button
Sep 24, 2023
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu

Figure 1 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 2 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 3 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 4 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Viaarxiv icon

Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

Add code
Bookmark button
Alert button
May 25, 2023
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan

Figure 1 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 2 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 3 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 4 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Viaarxiv icon

MUG: A General Meeting Understanding and Generation Benchmark

Add code
Bookmark button
Alert button
Mar 27, 2023
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao

Figure 1 for MUG: A General Meeting Understanding and Generation Benchmark
Figure 2 for MUG: A General Meeting Understanding and Generation Benchmark
Figure 3 for MUG: A General Meeting Understanding and Generation Benchmark
Figure 4 for MUG: A General Meeting Understanding and Generation Benchmark
Viaarxiv icon

Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)

Add code
Bookmark button
Alert button
Mar 24, 2023
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao

Figure 1 for Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)
Figure 2 for Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)
Viaarxiv icon