Picture for Kaifeng Gao

Kaifeng Gao

ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models

Add code
Jun 16, 2024
Figure 1 for ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Figure 2 for ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Figure 3 for ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Figure 4 for ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Viaarxiv icon

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

Add code
Jul 30, 2023
Figure 1 for Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
Figure 2 for Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
Figure 3 for Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
Figure 4 for Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
Viaarxiv icon

Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection

Add code
Feb 01, 2023
Figure 1 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 2 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 3 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 4 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Viaarxiv icon

Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives

Add code
Apr 25, 2022
Figure 1 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 2 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 3 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 4 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Viaarxiv icon

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

Add code
Dec 08, 2021
Figure 1 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 2 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 3 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 4 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Viaarxiv icon

Video Relation Detection via Tracklet based Visual Transformer

Add code
Aug 19, 2021
Figure 1 for Video Relation Detection via Tracklet based Visual Transformer
Figure 2 for Video Relation Detection via Tracklet based Visual Transformer
Figure 3 for Video Relation Detection via Tracklet based Visual Transformer
Figure 4 for Video Relation Detection via Tracklet based Visual Transformer
Viaarxiv icon

Julia Language in Machine Learning: Algorithms, Applications, and Open Issues

Add code
Mar 23, 2020
Figure 1 for Julia Language in Machine Learning: Algorithms, Applications, and Open Issues
Figure 2 for Julia Language in Machine Learning: Algorithms, Applications, and Open Issues
Figure 3 for Julia Language in Machine Learning: Algorithms, Applications, and Open Issues
Figure 4 for Julia Language in Machine Learning: Algorithms, Applications, and Open Issues
Viaarxiv icon