Alert button

"Text": models, code, and papers
Alert button

Video Referring Expression Comprehension via Transformer with Content-conditioned Query

Oct 25, 2023
Ji Jiang, Meng Cao, Tengtao Song, Long Chen, Yi Wang, Yuexian Zou

Figure 1 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 2 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 3 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 4 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Viaarxiv icon

FiLM: Fill-in Language Models for Any-Order Generation

Oct 15, 2023
Tianxiao Shen, Hao Peng, Ruoqi Shen, Yao Fu, Zaid Harchaoui, Yejin Choi

Viaarxiv icon

Audio-Visual Neural Syntax Acquisition

Oct 11, 2023
Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David Cox, David Harwath, Yang Zhang, Karen Livescu, James Glass

Figure 1 for Audio-Visual Neural Syntax Acquisition
Figure 2 for Audio-Visual Neural Syntax Acquisition
Figure 3 for Audio-Visual Neural Syntax Acquisition
Figure 4 for Audio-Visual Neural Syntax Acquisition
Viaarxiv icon

Semi-Supervised Panoptic Narrative Grounding

Oct 27, 2023
Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji

Viaarxiv icon

Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images

Aug 31, 2023
Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu

Figure 1 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 2 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 3 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 4 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Viaarxiv icon

Multimodal Graph Learning for Generative Tasks

Oct 12, 2023
Minji Yoon, Jing Yu Koh, Bryan Hooi, Ruslan Salakhutdinov

Figure 1 for Multimodal Graph Learning for Generative Tasks
Figure 2 for Multimodal Graph Learning for Generative Tasks
Figure 3 for Multimodal Graph Learning for Generative Tasks
Figure 4 for Multimodal Graph Learning for Generative Tasks
Viaarxiv icon

Improving Summarization with Human Edits

Oct 24, 2023
Zonghai Yao, Benjamin J Schloss, Sai P. Selvaraj

Viaarxiv icon

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Oct 24, 2023
Zayne Sprague, Xi Ye, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett

Viaarxiv icon

CDSD: Chinese Dysarthria Speech Database

Oct 24, 2023
Mengyi Sun, Ming Gao, Xinchen Kang, Shiru Wang, Jun Du, Dengfeng Yao, Su-Jing Wang

Viaarxiv icon

FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering

Oct 24, 2023
Md Rafi Ur Rashid, Vishnu Asutosh Dasu, Kang Gu, Najrin Sultana, Shagufta Mehnaz

Viaarxiv icon