Alert button

"Text": models, code, and papers
Alert button

CLoVe: Encoding Compositional Language in Contrastive Vision-Language Models

Mar 01, 2024
Santiago Castro, Amir Ziai, Avneesh Saluja, Zhuoning Yuan, Rada Mihalcea

Viaarxiv icon

Multi-Intent Attribute-Aware Text Matching in Searching

Feb 12, 2024
Mingzhe Li, Xiuying Chen, Jing Xiang, Qishen Zhang, Changsheng Ma, Chenchen Dai, Jinxiong Chang, Zhongyi Liu, Guannan Zhang

Viaarxiv icon

NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents

Feb 27, 2024
Tamara Czinczoll, Christoph Hönes, Maximilian Schall, Gerard de Melo

Viaarxiv icon

Word Order and World Knowledge

Mar 01, 2024
Qinghua Zhao, Vinit Ravishankar, Nicolas Garneau, Anders Søgaard

Figure 1 for Word Order and World Knowledge
Figure 2 for Word Order and World Knowledge
Figure 3 for Word Order and World Knowledge
Figure 4 for Word Order and World Knowledge
Viaarxiv icon

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion

Feb 16, 2024
Hila Manor, Tomer Michaeli

Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Mar 02, 2024
Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Feb 27, 2024
Zhenting Qi, Hanlin Zhang, Eric Xing, Sham Kakade, Himabindu Lakkaraju

Viaarxiv icon

Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction

Feb 27, 2024
Senjuti Dutta, Sherol Chen, Sunny Mak, Amnah Ahmad, Katherine Collins, Alena Butryna, Deepak Ramachandran, Krishnamurthy Dvijotham, Ellie Pavlick, Ravi Rajakumar

Figure 1 for Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Figure 2 for Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Figure 3 for Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Figure 4 for Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Viaarxiv icon

Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection

Mar 03, 2024
Sam Dauncey, Chris Holmes, Christopher Williams, Fabian Falck

Figure 1 for Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection
Figure 2 for Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection
Figure 3 for Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection
Figure 4 for Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection
Viaarxiv icon

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Feb 15, 2024
Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Viaarxiv icon