Picture for Xuxin Cheng

Xuxin Cheng

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation

Add code
Sep 14, 2024
Figure 1 for Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation
Figure 2 for Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation
Figure 3 for Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation
Viaarxiv icon

ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation

Add code
Aug 21, 2024
Figure 1 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 2 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 3 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 4 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Viaarxiv icon

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model

Add code
Aug 18, 2024
Viaarxiv icon

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models

Add code
Aug 06, 2024
Figure 1 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 2 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 3 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 4 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Viaarxiv icon

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

Add code
Jul 24, 2024
Figure 1 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 2 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 3 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 4 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Viaarxiv icon

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Add code
Jul 01, 2024
Figure 1 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 2 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 3 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 4 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Viaarxiv icon

CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction

Add code
Jul 01, 2024
Viaarxiv icon

EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

Add code
Jul 01, 2024
Figure 1 for EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction
Figure 2 for EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction
Figure 3 for EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction
Figure 4 for EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction
Viaarxiv icon

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Add code
May 31, 2024
Figure 1 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 2 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 3 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 4 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Viaarxiv icon