Alert button
Picture for Long Chen

Long Chen

Alert button

Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives

Add code
Bookmark button
Alert button
Apr 25, 2022
Shaoning Xiao, Long Chen, Kaifeng Gao, Zhao Wang, Yi Yang, Jun Xiao

Figure 1 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 2 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 3 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Figure 4 for Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives
Viaarxiv icon

Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network

Add code
Bookmark button
Alert button
Apr 20, 2022
Yuhang He, Lin Chen, Junkun Xie, Long Chen

Figure 1 for Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network
Figure 2 for Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network
Figure 3 for Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network
Figure 4 for Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network
Viaarxiv icon

Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

Add code
Bookmark button
Alert button
Apr 19, 2022
Justin Baker, Hedi Xia, Yiwei Wang, Elena Cherkaev, Akil Narayan, Long Chen, Jack Xin, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

Figure 1 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 2 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 3 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 4 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Viaarxiv icon

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting

Add code
Bookmark button
Alert button
Apr 16, 2022
Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Rama Chellappa, Shih-Fu Chang

Figure 1 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 2 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 3 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 4 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Viaarxiv icon

Few-Shot Object Detection with Fully Cross-Transformer

Add code
Bookmark button
Alert button
Mar 28, 2022
Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Shih-Fu Chang

Figure 1 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 2 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 3 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 4 for Few-Shot Object Detection with Fully Cross-Transformer
Viaarxiv icon

SATr: Slice Attention with Transformer for Universal Lesion Detection

Add code
Bookmark button
Alert button
Mar 13, 2022
Han Li, Long Chen, Hu Han, S. Kevin Zhou

Figure 1 for SATr: Slice Attention with Transformer for Universal Lesion Detection
Figure 2 for SATr: Slice Attention with Transformer for Universal Lesion Detection
Figure 3 for SATr: Slice Attention with Transformer for Universal Lesion Detection
Figure 4 for SATr: Slice Attention with Transformer for Universal Lesion Detection
Viaarxiv icon

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach

Add code
Bookmark button
Alert button
Mar 10, 2022
Xiaohan Lan, Yitian Yuan, Xin Wang, Long Chen, Zhi Wang, Lin Ma, Wenwu Zhu

Figure 1 for A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Figure 2 for A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Figure 3 for A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Figure 4 for A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Viaarxiv icon

openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer

Add code
Bookmark button
Alert button
Feb 24, 2022
Kishan K C, Zhenning Tan, Long Chen, Minho Jin, Eunjung Han, Andreas Stolcke, Chul Lee

Figure 1 for openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer
Figure 2 for openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer
Figure 3 for openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer
Figure 4 for openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer
Viaarxiv icon

Rethinking the Two-Stage Framework for Grounded Situation Recognition

Add code
Bookmark button
Alert button
Dec 10, 2021
Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Figure 1 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 2 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 3 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 4 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Viaarxiv icon

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

Add code
Bookmark button
Alert button
Dec 08, 2021
Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao

Figure 1 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 2 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 3 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Figure 4 for Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Viaarxiv icon