Alert button

"Text": models, code, and papers
Alert button

NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion

Dec 07, 2023
Savva Ignatyev, Daniil Selikhanovych, Oleg Voynov, Yiqun Wang, Peter Wonka, Stamatios Lefkimmiatis, Evgeny Burnaev

Viaarxiv icon

WonderJourney: Going from Anywhere to Everywhere

Dec 06, 2023
Hong-Xing Yu, Haoyi Duan, Junhwa Hur, Kyle Sargent, Michael Rubinstein, William T. Freeman, Forrester Cole, Deqing Sun, Noah Snavely, Jiajun Wu, Charles Herrmann

Viaarxiv icon

Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models

Nov 06, 2023
Daniel Yang, Aditya Kommineni, Mohammad Alshehri, Nilamadhab Mohanty, Vedant Modi, Jonathan Gratch, Shrikanth Narayanan

Viaarxiv icon

AVID: Any-Length Video Inpainting with Diffusion Model

Dec 06, 2023
Zhixing Zhang, Bichen Wu, Xiaoyan Wang, Yaqiao Luo, Luxin Zhang, Yinan Zhao, Peter Vajda, Dimitris Metaxas, Licheng Yu

Figure 1 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 2 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 3 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 4 for AVID: Any-Length Video Inpainting with Diffusion Model
Viaarxiv icon

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Dec 06, 2023
Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou

Viaarxiv icon

Efficient Quantization Strategies for Latent Diffusion Models

Dec 09, 2023
Yuewei Yang, Xiaoliang Dai, Jialiang Wang, Peizhao Zhang, Hongbo Zhang

Viaarxiv icon

Extending Whisper with prompt tuning to target-speaker ASR

Dec 13, 2023
Hao Ma, Zhiyuan Peng, Mingjie Shao, Jing Li, Ju Liu

Viaarxiv icon

BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text

Oct 31, 2023
Aarohi Srivastava, David Chiang

Viaarxiv icon

DER-GCN: Dialogue and Event Relation-Aware Graph Convolutional Neural Network for Multimodal Dialogue Emotion Recognition

Dec 17, 2023
Wei Ai, Yuntao Shou, Tao Meng, Keqin Li

Viaarxiv icon

Motion Flow Matching for Human Motion Synthesis and Editing

Dec 14, 2023
Vincent Tao Hu, Wenzhe Yin, Pingchuan Ma, Yunlu Chen, Basura Fernando, Yuki M Asano, Efstratios Gavves, Pascal Mettes, Bjorn Ommer, Cees G. M. Snoek

Viaarxiv icon