Picture for Pan Zhou

Pan Zhou

The Hubei Engineering Research Center on Big Data Security, School of Cyber Science and Engineering, Huazhong University of Science and Technology

Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior

Add code
Jan 17, 2024
Viaarxiv icon

MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

Add code
Jan 07, 2024
Viaarxiv icon

The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

Add code
Jan 07, 2024
Figure 1 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Figure 2 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Figure 1 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Figure 2 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Viaarxiv icon

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Add code
Dec 15, 2023
Viaarxiv icon

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

Add code
Dec 15, 2023
Viaarxiv icon

Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks

Add code
Dec 14, 2023
Figure 1 for Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks
Figure 2 for Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks
Figure 3 for Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks
Figure 4 for Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks
Viaarxiv icon

Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator

Add code
Dec 11, 2023
Viaarxiv icon

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

Add code
Dec 06, 2023
Figure 1 for Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Figure 2 for Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Figure 3 for Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Figure 4 for Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Viaarxiv icon

Exploring the Robustness of Decentralized Training for Large Language Models

Add code
Dec 01, 2023
Viaarxiv icon