Picture for Jingkuan Song

Jingkuan Song

Prompting for Multi-Modal Tracking

Add code
Aug 01, 2022
Figure 1 for Prompting for Multi-Modal Tracking
Figure 2 for Prompting for Multi-Modal Tracking
Figure 3 for Prompting for Multi-Modal Tracking
Figure 4 for Prompting for Multi-Modal Tracking
Viaarxiv icon

Frequency Domain Model Augmentation for Adversarial Attack

Add code
Jul 12, 2022
Figure 1 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 2 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 3 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 4 for Frequency Domain Model Augmentation for Adversarial Attack
Viaarxiv icon

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation

Add code
Jul 11, 2022
Figure 1 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 2 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 3 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 4 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Viaarxiv icon

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Add code
Jun 30, 2022
Figure 1 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 2 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 3 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 4 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Viaarxiv icon

Learning To Generate Scene Graph from Head to Tail

Add code
Jun 23, 2022
Figure 1 for Learning To Generate Scene Graph from Head to Tail
Figure 2 for Learning To Generate Scene Graph from Head to Tail
Figure 3 for Learning To Generate Scene Graph from Head to Tail
Figure 4 for Learning To Generate Scene Graph from Head to Tail
Viaarxiv icon

KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing

Add code
Jun 21, 2022
Figure 1 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 2 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 3 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 4 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Viaarxiv icon

KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences

Add code
Jun 21, 2022
Figure 1 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 2 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 3 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 4 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Viaarxiv icon

From Pixels to Objects: Cubic Visual Attention for Visual Question Answering

Add code
Jun 04, 2022
Figure 1 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 2 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 3 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 4 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Viaarxiv icon

Structured Two-stream Attention Network for Video Question Answering

Add code
Jun 02, 2022
Figure 1 for Structured Two-stream Attention Network for Video Question Answering
Figure 2 for Structured Two-stream Attention Network for Video Question Answering
Figure 3 for Structured Two-stream Attention Network for Video Question Answering
Figure 4 for Structured Two-stream Attention Network for Video Question Answering
Viaarxiv icon

Support-set based Multi-modal Representation Enhancement for Video Captioning

Add code
May 19, 2022
Figure 1 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 2 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 3 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 4 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Viaarxiv icon