Picture for Daqing Liu

Daqing Liu

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

Add code
Jun 14, 2022
Figure 1 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 2 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 3 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 4 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Viaarxiv icon

Modeling Image Composition for Complex Scene Generation

Add code
Jun 02, 2022
Figure 1 for Modeling Image Composition for Complex Scene Generation
Figure 2 for Modeling Image Composition for Complex Scene Generation
Figure 3 for Modeling Image Composition for Complex Scene Generation
Figure 4 for Modeling Image Composition for Complex Scene Generation
Viaarxiv icon

Compact Bidirectional Transformer for Image Captioning

Add code
Jan 06, 2022
Figure 1 for Compact Bidirectional Transformer for Image Captioning
Figure 2 for Compact Bidirectional Transformer for Image Captioning
Figure 3 for Compact Bidirectional Transformer for Image Captioning
Figure 4 for Compact Bidirectional Transformer for Image Captioning
Viaarxiv icon

Learning to Discretely Compose Reasoning Module Networks for Video Captioning

Add code
Jul 17, 2020
Figure 1 for Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Figure 2 for Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Figure 3 for Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Figure 4 for Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Viaarxiv icon

More Grounded Image Captioning by Distilling Image-Text Matching Model

Add code
Apr 01, 2020
Figure 1 for More Grounded Image Captioning by Distilling Image-Text Matching Model
Figure 2 for More Grounded Image Captioning by Distilling Image-Text Matching Model
Figure 3 for More Grounded Image Captioning by Distilling Image-Text Matching Model
Figure 4 for More Grounded Image Captioning by Distilling Image-Text Matching Model
Viaarxiv icon

Referring Expression Grounding by Marginalizing Scene Graph Likelihood

Add code
Jun 09, 2019
Figure 1 for Referring Expression Grounding by Marginalizing Scene Graph Likelihood
Figure 2 for Referring Expression Grounding by Marginalizing Scene Graph Likelihood
Figure 3 for Referring Expression Grounding by Marginalizing Scene Graph Likelihood
Figure 4 for Referring Expression Grounding by Marginalizing Scene Graph Likelihood
Viaarxiv icon

Context-Aware Visual Policy Network for Fine-Grained Image Captioning

Add code
Jun 06, 2019
Figure 1 for Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Figure 2 for Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Figure 3 for Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Figure 4 for Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Viaarxiv icon

Learning to Compose and Reason with Language Tree Structures for Visual Grounding

Add code
Jun 05, 2019
Figure 1 for Learning to Compose and Reason with Language Tree Structures for Visual Grounding
Figure 2 for Learning to Compose and Reason with Language Tree Structures for Visual Grounding
Figure 3 for Learning to Compose and Reason with Language Tree Structures for Visual Grounding
Figure 4 for Learning to Compose and Reason with Language Tree Structures for Visual Grounding
Viaarxiv icon

Explainability by Parsing: Neural Module Tree Networks for Natural Language Visual Grounding

Add code
Dec 08, 2018
Figure 1 for Explainability by Parsing: Neural Module Tree Networks for Natural Language Visual Grounding
Figure 2 for Explainability by Parsing: Neural Module Tree Networks for Natural Language Visual Grounding
Figure 3 for Explainability by Parsing: Neural Module Tree Networks for Natural Language Visual Grounding
Figure 4 for Explainability by Parsing: Neural Module Tree Networks for Natural Language Visual Grounding
Viaarxiv icon

Context-Aware Visual Policy Network for Sequence-Level Image Captioning

Add code
Aug 22, 2018
Figure 1 for Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Figure 2 for Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Figure 3 for Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Figure 4 for Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Viaarxiv icon