Alert button
Picture for Jingkuan Song

Jingkuan Song

Alert button

Prompting for Multi-Modal Tracking

Aug 01, 2022
Jinyu Yang, Zhe Li, Feng Zheng, Aleš Leonardis, Jingkuan Song

Figure 1 for Prompting for Multi-Modal Tracking
Figure 2 for Prompting for Multi-Modal Tracking
Figure 3 for Prompting for Multi-Modal Tracking
Figure 4 for Prompting for Multi-Modal Tracking
Viaarxiv icon

Frequency Domain Model Augmentation for Adversarial Attack

Jul 12, 2022
Yuyang Long, Qilong Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, Jingkuan Song

Figure 1 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 2 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 3 for Frequency Domain Model Augmentation for Adversarial Attack
Figure 4 for Frequency Domain Model Augmentation for Adversarial Attack
Viaarxiv icon

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation

Jul 11, 2022
Xinyu Lyu, Lianli Gao, Pengpeng Zeng, Heng Tao Shen, Jingkuan Song

Figure 1 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 2 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 3 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Figure 4 for Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Viaarxiv icon

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Jun 30, 2022
Xuanhan Wang, Yan Dai, Lianli Gao, Jingkuan Song

Figure 1 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 2 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 3 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 4 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Viaarxiv icon

Learning To Generate Scene Graph from Head to Tail

Jun 23, 2022
Chaofan Zheng, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song, Lianli Gao

Figure 1 for Learning To Generate Scene Graph from Head to Tail
Figure 2 for Learning To Generate Scene Graph from Head to Tail
Figure 3 for Learning To Generate Scene Graph from Head to Tail
Figure 4 for Learning To Generate Scene Graph from Head to Tail
Viaarxiv icon

KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing

Jun 21, 2022
Xuanhan Wang, Jingkuan Song, Xiaojia Chen, Lechao Cheng, Lianli Gao, Heng Tao Shen

Figure 1 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 2 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 3 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 4 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Viaarxiv icon

KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences

Jun 21, 2022
Xuanhan Wang, Lianli Gao, Yixuan Zhou, Jingkuan Song, Meng Wang

Figure 1 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 2 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 3 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 4 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Viaarxiv icon

From Pixels to Objects: Cubic Visual Attention for Visual Question Answering

Jun 04, 2022
Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen

Figure 1 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 2 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 3 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Figure 4 for From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Viaarxiv icon

Structured Two-stream Attention Network for Video Question Answering

Jun 02, 2022
Lianli Gao, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, Heng Tao Shen

Figure 1 for Structured Two-stream Attention Network for Video Question Answering
Figure 2 for Structured Two-stream Attention Network for Video Question Answering
Figure 3 for Structured Two-stream Attention Network for Video Question Answering
Figure 4 for Structured Two-stream Attention Network for Video Question Answering
Viaarxiv icon

Support-set based Multi-modal Representation Enhancement for Video Captioning

May 19, 2022
Xiaoya Chen, Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen

Figure 1 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 2 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 3 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Figure 4 for Support-set based Multi-modal Representation Enhancement for Video Captioning
Viaarxiv icon