Alert button

"Text": models, code, and papers
Alert button

Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion

Dec 04, 2023
Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava

Viaarxiv icon

UstanceBR: a multimodal language resource for stance prediction

Dec 11, 2023
Camila Pereira, Matheus Pavan, Sungwon Yoon, Ricelli Ramos, Pablo Costa, Lais Cavalheiro, Ivandre Paraboni

Figure 1 for UstanceBR: a multimodal language resource for stance prediction
Figure 2 for UstanceBR: a multimodal language resource for stance prediction
Figure 3 for UstanceBR: a multimodal language resource for stance prediction
Figure 4 for UstanceBR: a multimodal language resource for stance prediction
Viaarxiv icon

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

Dec 15, 2023
Ao Zhang, Pan Zhou, Kaixun Huang, Yong Zou, Ming Liu, Lei Xie

Figure 1 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 2 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 3 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 4 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Viaarxiv icon

MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation

Dec 15, 2023
Suyi Jiang, Haimin Luo, Haoran Jiang, Ziyu Wang, Jingyi Yu, Lan Xu

Viaarxiv icon

Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models

Nov 06, 2023
Daniel Yang, Aditya Kommineni, Mohammad Alshehri, Nilamadhab Mohanty, Vedant Modi, Jonathan Gratch, Shrikanth Narayanan

Viaarxiv icon

WonderJourney: Going from Anywhere to Everywhere

Dec 06, 2023
Hong-Xing Yu, Haoyi Duan, Junhwa Hur, Kyle Sargent, Michael Rubinstein, William T. Freeman, Forrester Cole, Deqing Sun, Noah Snavely, Jiajun Wu, Charles Herrmann

Viaarxiv icon

NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion

Dec 07, 2023
Savva Ignatyev, Daniil Selikhanovych, Oleg Voynov, Yiqun Wang, Peter Wonka, Stamatios Lefkimmiatis, Evgeny Burnaev

Viaarxiv icon

BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text

Oct 31, 2023
Aarohi Srivastava, David Chiang

Viaarxiv icon

AVID: Any-Length Video Inpainting with Diffusion Model

Dec 06, 2023
Zhixing Zhang, Bichen Wu, Xiaoyan Wang, Yaqiao Luo, Luxin Zhang, Yinan Zhao, Peter Vajda, Dimitris Metaxas, Licheng Yu

Figure 1 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 2 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 3 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 4 for AVID: Any-Length Video Inpainting with Diffusion Model
Viaarxiv icon

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Dec 06, 2023
Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou

Viaarxiv icon