Picture for Xuxin Cheng

Xuxin Cheng

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model

Add code
Aug 18, 2024
Viaarxiv icon

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models

Add code
Aug 06, 2024
Figure 1 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 2 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 3 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 4 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Viaarxiv icon

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

Add code
Jul 24, 2024
Figure 1 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 2 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 3 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 4 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Viaarxiv icon

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Add code
Jul 01, 2024
Figure 1 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 2 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 3 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 4 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Viaarxiv icon

CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction

Add code
Jul 01, 2024
Viaarxiv icon

EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

Add code
Jul 01, 2024
Viaarxiv icon

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Add code
May 31, 2024
Figure 1 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 2 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 3 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 4 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Viaarxiv icon

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation

Add code
May 17, 2024
Viaarxiv icon

WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD

Add code
May 03, 2024
Figure 1 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Figure 2 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Figure 3 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Viaarxiv icon