Picture for Jinhui Tang

Jinhui Tang

Rebalanced Multimodal Learning with Data-aware Unimodal Sampling

Add code
Mar 05, 2025
Viaarxiv icon

Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model

Add code
Feb 13, 2025
Viaarxiv icon

Vision-centric Token Compression in Large Language Model

Add code
Feb 04, 2025
Figure 1 for Vision-centric Token Compression in Large Language Model
Figure 2 for Vision-centric Token Compression in Large Language Model
Figure 3 for Vision-centric Token Compression in Large Language Model
Figure 4 for Vision-centric Token Compression in Large Language Model
Viaarxiv icon

Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Add code
Jan 03, 2025
Figure 1 for Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
Figure 2 for Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
Figure 3 for Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
Figure 4 for Merging Context Clustering with Visual State Space Models for Medical Image Segmentation
Viaarxiv icon

Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation

Add code
Dec 14, 2024
Figure 1 for Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Figure 2 for Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Figure 3 for Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Figure 4 for Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Viaarxiv icon

ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring

Add code
Dec 12, 2024
Figure 1 for ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring
Figure 2 for ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring
Figure 3 for ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring
Figure 4 for ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring
Viaarxiv icon

Embedding and Enriching Explicit Semantics for Visible-Infrared Person Re-Identification

Add code
Dec 11, 2024
Viaarxiv icon

FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

Add code
Dec 02, 2024
Figure 1 for FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
Figure 2 for FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
Figure 3 for FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
Figure 4 for FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
Viaarxiv icon

Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection

Add code
Dec 02, 2024
Viaarxiv icon

Precision Profile Pollution Attack on Sequential Recommenders via Influence Function

Add code
Dec 02, 2024
Figure 1 for Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
Figure 2 for Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
Figure 3 for Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
Figure 4 for Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
Viaarxiv icon