Picture for Bin Sun

Bin Sun

Member, IEEE

Learning to Locate Visual Answer in Video Corpus Using Question

Add code
Oct 11, 2022
Figure 1 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 2 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 3 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 4 for Learning to Locate Visual Answer in Video Corpus Using Question
Viaarxiv icon

TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking

Add code
Jul 26, 2022
Figure 1 for TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking
Figure 2 for TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking
Figure 3 for TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking
Figure 4 for TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking
Viaarxiv icon

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

Add code
Jul 05, 2022
Figure 1 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 2 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 3 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 4 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Viaarxiv icon

Explicit and implicit models in infrared and visible image fusion

Add code
Jun 20, 2022
Figure 1 for Explicit and implicit models in infrared and visible image fusion
Figure 2 for Explicit and implicit models in infrared and visible image fusion
Viaarxiv icon

Towards Layer-wise Image Vectorization

Add code
Jun 09, 2022
Figure 1 for Towards Layer-wise Image Vectorization
Figure 2 for Towards Layer-wise Image Vectorization
Figure 3 for Towards Layer-wise Image Vectorization
Figure 4 for Towards Layer-wise Image Vectorization
Viaarxiv icon

Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning

Add code
May 23, 2022
Figure 1 for Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Figure 2 for Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Figure 3 for Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Figure 4 for Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Viaarxiv icon

Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild

Add code
May 10, 2022
Figure 1 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 2 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 3 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 4 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Viaarxiv icon

Diversifying Neural Dialogue Generation via Negative Distillation

Add code
May 05, 2022
Figure 1 for Diversifying Neural Dialogue Generation via Negative Distillation
Figure 2 for Diversifying Neural Dialogue Generation via Negative Distillation
Figure 3 for Diversifying Neural Dialogue Generation via Negative Distillation
Figure 4 for Diversifying Neural Dialogue Generation via Negative Distillation
Viaarxiv icon

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs

Add code
Apr 20, 2022
Figure 1 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 2 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 3 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 4 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Viaarxiv icon

Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video

Add code
Mar 29, 2022
Figure 1 for Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Figure 2 for Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Figure 3 for Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Figure 4 for Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Viaarxiv icon