Alert button

"Text": models, code, and papers
Alert button

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Nov 30, 2023
Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong

Viaarxiv icon

Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents

Dec 21, 2023
Jing Li, Qiu-Feng Wang, Kaizhu Huang, Rui Zhang, Siyuan Wang

Viaarxiv icon

Enhancing Medical Text Evaluation with GPT-4

Nov 16, 2023
Yiqing Xie, Sheng Zhang, Hao Cheng, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon

Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Dec 23, 2023
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shixiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu

Viaarxiv icon

Theory of Hallucinations based on Equivariance

Dec 22, 2023
Hisaichi Shibata

Viaarxiv icon

Quantifying the redundancy between prosody and text

Nov 28, 2023
Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev

Viaarxiv icon

PowMix: A Versatile Regularizer for Multimodal Sentiment Analysis

Dec 19, 2023
Efthymios Georgiou, Yannis Avrithis, Alexandros Potamianos

Viaarxiv icon

LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching

Nov 22, 2023
Yixun Liang, Xin Yang, Jiantao Lin, Haodong Li, Xiaogang Xu, Yingcong Chen

Figure 1 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 2 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 3 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 4 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Viaarxiv icon

How Robust are LLMs to In-Context Majority Label Bias?

Dec 27, 2023
Karan Gupta, Sumegh Roychowdhury, Siva Rajesh Kasa, Santhosh Kumar Kasa, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Dec 14, 2023
Jinguo Zhu, Xiaohan Ding, Yixiao Ge, Yuying Ge, Sijie Zhao, Hengshuang Zhao, Xiaohua Wang, Ying Shan

Viaarxiv icon