Picture for Shen Ge

Shen Ge

Aligning Source Visual and Target Language Domains for Unpaired Video Captioning

Add code
Nov 22, 2022
Figure 1 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 2 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 3 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 4 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Viaarxiv icon

Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Add code
Nov 21, 2022
Figure 1 for Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Figure 2 for Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Figure 3 for Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Figure 4 for Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Viaarxiv icon

DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention

Add code
Oct 28, 2022
Figure 1 for DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
Figure 2 for DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
Figure 3 for DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
Figure 4 for DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
Viaarxiv icon

Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions

Add code
Oct 23, 2022
Figure 1 for Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions
Figure 2 for Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions
Figure 3 for Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions
Figure 4 for Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions
Viaarxiv icon

Competence-based Multimodal Curriculum Learning for Medical Report Generation

Add code
Jun 24, 2022
Figure 1 for Competence-based Multimodal Curriculum Learning for Medical Report Generation
Figure 2 for Competence-based Multimodal Curriculum Learning for Medical Report Generation
Figure 3 for Competence-based Multimodal Curriculum Learning for Medical Report Generation
Figure 4 for Competence-based Multimodal Curriculum Learning for Medical Report Generation
Viaarxiv icon

Graph-in-Graph Network for Automatic Gene Ontology Description Generation

Add code
Jun 10, 2022
Figure 1 for Graph-in-Graph Network for Automatic Gene Ontology Description Generation
Figure 2 for Graph-in-Graph Network for Automatic Gene Ontology Description Generation
Figure 3 for Graph-in-Graph Network for Automatic Gene Ontology Description Generation
Figure 4 for Graph-in-Graph Network for Automatic Gene Ontology Description Generation
Viaarxiv icon

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

Add code
Apr 29, 2022
Figure 1 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 2 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 3 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 4 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Viaarxiv icon

Hazard Detection And Avoidance For The Nova-C Lander

Add code
Apr 01, 2022
Figure 1 for Hazard Detection And Avoidance For The Nova-C Lander
Figure 2 for Hazard Detection And Avoidance For The Nova-C Lander
Figure 3 for Hazard Detection And Avoidance For The Nova-C Lander
Figure 4 for Hazard Detection And Avoidance For The Nova-C Lander
Viaarxiv icon

AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation

Add code
Mar 18, 2022
Figure 1 for AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation
Figure 2 for AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation
Figure 3 for AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation
Figure 4 for AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation
Viaarxiv icon

Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Add code
Dec 30, 2021
Figure 1 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment
Figure 2 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment
Figure 3 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment
Figure 4 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment
Viaarxiv icon