Alert button
Picture for Ming Cheng

Ming Cheng

Alert button

VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning

Add code
Bookmark button
Alert button
Apr 11, 2024
Ming Cheng, Bowen Zhang, Ziyu Wang, Ziyi Zhou, Weiqi Feng, Yi Lyu, Xingjian Diao

Viaarxiv icon

Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds

Add code
Bookmark button
Alert button
Mar 27, 2024
Zhimin Yuan, Wankang Zeng, Yanfei Su, Weiquan Liu, Ming Cheng, Yulan Guo, Cheng Wang

Viaarxiv icon

STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow

Add code
Bookmark button
Alert button
Mar 11, 2024
Zhiyang Lu, Qinghan Chen, Ming Cheng

Figure 1 for STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
Figure 2 for STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
Figure 3 for STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
Figure 4 for STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow
Viaarxiv icon

Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer

Add code
Bookmark button
Alert button
Mar 04, 2024
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li

Figure 1 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 2 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 3 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 4 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Viaarxiv icon

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

Add code
Bookmark button
Alert button
Jan 16, 2024
Ming Cheng, Ming Li

Viaarxiv icon

SAIC: Integration of Speech Anonymization and Identity Classification

Add code
Bookmark button
Alert button
Dec 23, 2023
Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu

Viaarxiv icon

FT2TF: First-Person Statement Text-To-Talking Face Generation

Add code
Bookmark button
Alert button
Dec 09, 2023
Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin

Figure 1 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 2 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 3 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 4 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Viaarxiv icon

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder

Add code
Bookmark button
Alert button
Sep 15, 2023
Xingjian Diao, Ming Cheng, Shitong Cheng

Figure 1 for AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder
Figure 2 for AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder
Figure 3 for AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder
Figure 4 for AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder
Viaarxiv icon

VoxBlink: X-Large Speaker Verification Dataset on Camera

Add code
Bookmark button
Alert button
Aug 23, 2023
Yuke Lin, Xiaoyi Qin, Ming Cheng, Ning Jiang, Guoqing Zhao, Ming Li

Figure 1 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 2 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 3 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 4 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Viaarxiv icon

Masked Cross-image Encoding for Few-shot Segmentation

Add code
Bookmark button
Alert button
Aug 22, 2023
Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Figure 1 for Masked Cross-image Encoding for Few-shot Segmentation
Figure 2 for Masked Cross-image Encoding for Few-shot Segmentation
Figure 3 for Masked Cross-image Encoding for Few-shot Segmentation
Figure 4 for Masked Cross-image Encoding for Few-shot Segmentation
Viaarxiv icon