Picture for Lantian Li

Lantian Li

An Investigation on Speaker Augmentation for End-to-End Speaker Extraction

Add code
May 27, 2025
Viaarxiv icon

Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Add code
Oct 21, 2024
Figure 1 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification
Figure 2 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification
Figure 3 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification
Viaarxiv icon

AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition

Add code
Oct 21, 2024
Figure 1 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 2 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 3 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 4 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Viaarxiv icon

Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective

Add code
Sep 29, 2024
Figure 1 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 2 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 3 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 4 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Viaarxiv icon

Serialized Output Training by Learned Dominance

Add code
Jul 04, 2024
Viaarxiv icon

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

Add code
Jun 14, 2024
Figure 1 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 2 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 3 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 4 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Viaarxiv icon

SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition

Add code
Jun 12, 2024
Figure 1 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 2 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 3 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 4 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Viaarxiv icon

Zero-Shot Fake Video Detection by Audio-Visual Consistency

Add code
Jun 12, 2024
Figure 1 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 2 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 3 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 4 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Viaarxiv icon

A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

Add code
Jun 11, 2024
Figure 1 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 2 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 3 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 4 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Viaarxiv icon

Adversarial Data Augmentation for Robust Speaker Verification

Add code
Feb 05, 2024
Viaarxiv icon