Alert button

"Text": models, code, and papers
Alert button

Do large language models and humans have similar behaviors in causal inference with script knowledge?

Nov 13, 2023
Xudong Hong, Margarita Ryzhova, Daniel Adrian Biondi, Vera Demberg

Viaarxiv icon

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Oct 25, 2023
Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi

Figure 1 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 2 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 3 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 4 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Viaarxiv icon

BERT Lost Patience Won't Be Robust to Adversarial Slowdown

Oct 29, 2023
Zachary Coalson, Gabriel Ritter, Rakesh Bobba, Sanghyun Hong

Viaarxiv icon

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Nov 14, 2023
Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu

Viaarxiv icon

Quality-Diversity through AI Feedback

Oct 31, 2023
Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

Viaarxiv icon

What's In My Big Data?

Oct 31, 2023
Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge

Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Videoprompter: an ensemble of foundational models for zero-shot video understanding

Oct 23, 2023
Adeel Yousaf, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

Figure 1 for Videoprompter: an ensemble of foundational models for zero-shot video understanding
Figure 2 for Videoprompter: an ensemble of foundational models for zero-shot video understanding
Figure 3 for Videoprompter: an ensemble of foundational models for zero-shot video understanding
Figure 4 for Videoprompter: an ensemble of foundational models for zero-shot video understanding
Viaarxiv icon

In-Context Prompt Editing For Conditional Audio Generation

Nov 01, 2023
Ernie Chang, Pin-Jie Lin, Yang Li, Sidd Srinivasan, Gael Le Lan, David Kant, Yangyang Shi, Forrest Iandola, Vikas Chandra

Viaarxiv icon

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning

Sep 20, 2023
Chen Jiang, Hong Liu, Xuzheng Yu, Qing Wang, Yuan Cheng, Jia Xu, Zhongyi Liu, Qingpei Guo, Wei Chu, Ming Yang, Yuan Qi

Figure 1 for Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Figure 2 for Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Figure 3 for Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Figure 4 for Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Viaarxiv icon

FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

Nov 03, 2023
Nan Zhang, Yusen Zhang, Wu Guo, Prasenjit Mitra, Rui Zhang

Viaarxiv icon