Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders



Gang Li , Heliang Zheng , Daqing Liu , Chaoyue Wang , Bing Su , Changwen Zheng

* 12 pages, 3 figures 

   Access Paper or Ask Questions

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer



Jiajun Deng , Zhengyuan Yang , Daqing Liu , Tianlang Chen , Wengang Zhou , Yanyong Zhang , Houqiang Li , Wanli Ouyang

* arXiv admin note: text overlap with arXiv:2104.08541 

   Access Paper or Ask Questions

Modeling Image Composition for Complex Scene Generation



Zuopeng Yang , Daqing Liu , Chaoyue Wang , Jie Yang , Dacheng Tao

* CVPR 2022 

   Access Paper or Ask Questions

Compact Bidirectional Transformer for Image Captioning



Yuanen Zhou , Zhenzhen Hu , Daqing Liu , Huixia Ben , Meng Wang


   Access Paper or Ask Questions

Learning to Discretely Compose Reasoning Module Networks for Video Captioning



Ganchao Tan , Daqing Liu , Meng Wang , Zheng-Jun Zha

* IJCAI 2020, Pages 745-752 
* Accepted at IJCAI 2020 Main Track. Sole copyright holder is IJCAI. Code is available at https://github.com/tgc1997/RMN 

   Access Paper or Ask Questions

More Grounded Image Captioning by Distilling Image-Text Matching Model



Yuanen Zhou , Meng Wang , Daqing Liu , Zhenzhen Hu , Hanwang Zhang

* Accepted by CVPR 2020 

   Access Paper or Ask Questions

Referring Expression Grounding by Marginalizing Scene Graph Likelihood



Daqing Liu , Hanwang Zhang , Zheng-Jun Zha , Fanglin Wang

* Submitted to NeurIPS 2019 

   Access Paper or Ask Questions

Context-Aware Visual Policy Network for Fine-Grained Image Captioning



Zheng-Jun Zha , Daqing Liu , Hanwang Zhang , Yongdong Zhang , Feng Wu

* Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Extended version of "Context-Aware Visual Policy Network for Sequence-Level Image Captioning", ACM MM 2018 (arXiv:1808.05864) 

   Access Paper or Ask Questions

Learning to Compose and Reason with Language Tree Structures for Visual Grounding



Richang Hong , Daqing Liu , Xiaoyu Mo , Xiangnan He , Hanwang Zhang

* Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 

   Access Paper or Ask Questions

1
2
>>