Picture for Hailin Jin

Hailin Jin

MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval

Add code
Jun 25, 2024
Figure 1 for MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Figure 2 for MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Figure 3 for MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Figure 4 for MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Viaarxiv icon

Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval

Add code
Jan 29, 2024
Figure 1 for Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval
Figure 2 for Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval
Figure 3 for Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval
Figure 4 for Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval
Viaarxiv icon

Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory

Add code
Sep 07, 2023
Figure 1 for Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Figure 2 for Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Figure 3 for Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Figure 4 for Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Viaarxiv icon

Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models

Add code
Sep 01, 2023
Figure 1 for Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models
Figure 2 for Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models
Figure 3 for Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models
Figure 4 for Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models
Viaarxiv icon

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training

Add code
Feb 28, 2023
Figure 1 for Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Figure 2 for Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Figure 3 for Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Figure 4 for Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Viaarxiv icon

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos

Add code
Oct 12, 2022
Figure 1 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 2 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 3 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 4 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Viaarxiv icon

Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment

Add code
Oct 10, 2022
Figure 1 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 2 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 3 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 4 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Viaarxiv icon

Video Activity Localisation with Uncertainties in Temporal Boundary

Add code
Jun 26, 2022
Figure 1 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 2 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 3 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 4 for Video Activity Localisation with Uncertainties in Temporal Boundary
Viaarxiv icon

MHMS: Multimodal Hierarchical Multimedia Summarization

Add code
Apr 07, 2022
Figure 1 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 2 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 3 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 4 for MHMS: Multimodal Hierarchical Multimedia Summarization
Viaarxiv icon

StyleBabel: Artistic Style Tagging and Captioning

Add code
Mar 11, 2022
Figure 1 for StyleBabel: Artistic Style Tagging and Captioning
Figure 2 for StyleBabel: Artistic Style Tagging and Captioning
Figure 3 for StyleBabel: Artistic Style Tagging and Captioning
Figure 4 for StyleBabel: Artistic Style Tagging and Captioning
Viaarxiv icon