Picture for Jingyu Li

Jingyu Li

Composed Multi-modal Retrieval: A Survey of Approaches and Applications

Add code
Mar 03, 2025
Figure 1 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 2 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 3 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 4 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon

Probing Speaker-specific Features in Speaker Representations

Add code
Jan 09, 2025
Figure 1 for Probing Speaker-specific Features in Speaker Representations
Figure 2 for Probing Speaker-specific Features in Speaker Representations
Figure 3 for Probing Speaker-specific Features in Speaker Representations
Figure 4 for Probing Speaker-specific Features in Speaker Representations
Viaarxiv icon

Contrastive Representation for Interactive Recommendation

Add code
Dec 24, 2024
Figure 1 for Contrastive Representation for Interactive Recommendation
Figure 2 for Contrastive Representation for Interactive Recommendation
Figure 3 for Contrastive Representation for Interactive Recommendation
Figure 4 for Contrastive Representation for Interactive Recommendation
Viaarxiv icon

FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms

Add code
Nov 30, 2024
Figure 1 for FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms
Figure 2 for FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms
Figure 3 for FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms
Figure 4 for FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms
Viaarxiv icon

An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems

Add code
Nov 18, 2024
Figure 1 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 2 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 3 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 4 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Viaarxiv icon

Dual-path Collaborative Generation Network for Emotional Video Captioning

Add code
Aug 06, 2024
Figure 1 for Dual-path Collaborative Generation Network for Emotional Video Captioning
Figure 2 for Dual-path Collaborative Generation Network for Emotional Video Captioning
Figure 3 for Dual-path Collaborative Generation Network for Emotional Video Captioning
Figure 4 for Dual-path Collaborative Generation Network for Emotional Video Captioning
Viaarxiv icon

DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation

Add code
Jun 06, 2024
Figure 1 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 2 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 3 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 4 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Viaarxiv icon

Label-efficient Multi-organ Segmentation Method with Diffusion Model

Add code
Feb 23, 2024
Figure 1 for Label-efficient Multi-organ Segmentation Method with Diffusion Model
Figure 2 for Label-efficient Multi-organ Segmentation Method with Diffusion Model
Figure 3 for Label-efficient Multi-organ Segmentation Method with Diffusion Model
Figure 4 for Label-efficient Multi-organ Segmentation Method with Diffusion Model
Viaarxiv icon

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss

Add code
Jan 08, 2024
Viaarxiv icon