Alert button

"Text": models, code, and papers
Alert button

Probabilistically-sound beam search with masked language models

Feb 22, 2024
Charlie Cowen-Breen, Creston Brooks, Robert Calef, Anna Sappington

Viaarxiv icon

Whose LLM is it Anyway? Linguistic Comparison and LLM Attribution for GPT-3.5, GPT-4 and Bard

Feb 22, 2024
Ariel Rosenfeld, Teddy Lazebnik

Viaarxiv icon

Music Style Transfer with Time-Varying Inversion of Diffusion Models

Feb 21, 2024
Sifei Li, Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming dong, Changsheng Xu

Viaarxiv icon

FlashTex: Fast Relightable Mesh Texturing with LightControlNet

Feb 20, 2024
Kangle Deng, Timothy Omernick, Alexander Weiss, Deva Ramanan, Jun-Yan Zhu, Tinghui Zhou, Maneesh Agrawala

Viaarxiv icon

CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting

Feb 21, 2024
Peter Schaldenbrand, Gaurav Parmar, Jun-Yan Zhu, James McCann, Jean Oh

Viaarxiv icon

Instruction-Guided Scene Text Recognition

Jan 31, 2024
Yongkun Du, Zhineng Chen, Yuchen Su, Caiyan Jia, Yu-Gang Jiang

Viaarxiv icon

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

Feb 27, 2024
Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme

Viaarxiv icon

Diffusion Model-Based Image Editing: A Survey

Feb 27, 2024
Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan, Jiaxi Lv, Jianzhuang Liu, Wei Xiong, He Zhang, Shifeng Chen, Liangliang Cao

Viaarxiv icon

Visual Style Prompting with Swapping Self-Attention

Feb 21, 2024
Jaeseok Jeong, Junho Kim, Yunjey Choi, Gayoung Lee, Youngjung Uh

Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Feb 12, 2024
Naoyuki Kanda, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Canrun Li, Steven Tsai, Zhen Xiao, Yufei Xia, Jinzhu Li, Yanqing Liu, Sheng Zhao, Michael Zeng

Viaarxiv icon