Picture for Wang Lin

Wang Lin

EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

Add code
Jun 20, 2024
Figure 1 for EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration
Figure 2 for EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration
Figure 3 for EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration
Figure 4 for EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration
Viaarxiv icon

Non-confusing Generation of Customized Concepts in Diffusion Models

Add code
May 11, 2024
Figure 1 for Non-confusing Generation of Customized Concepts in Diffusion Models
Figure 2 for Non-confusing Generation of Customized Concepts in Diffusion Models
Figure 3 for Non-confusing Generation of Customized Concepts in Diffusion Models
Figure 4 for Non-confusing Generation of Customized Concepts in Diffusion Models
Viaarxiv icon

OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment

Add code
Jun 10, 2023
Figure 1 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 2 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 3 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 4 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Viaarxiv icon

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

Add code
Mar 09, 2023
Figure 1 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 2 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 3 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Figure 4 for MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
Viaarxiv icon