Alert button

"Text": models, code, and papers
Alert button

MoVideo: Motion-Aware Video Generation with Diffusion Models

Nov 19, 2023
Jingyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc Van Gool, Rakesh Ranjan

Figure 1 for MoVideo: Motion-Aware Video Generation with Diffusion Models
Figure 2 for MoVideo: Motion-Aware Video Generation with Diffusion Models
Figure 3 for MoVideo: Motion-Aware Video Generation with Diffusion Models
Figure 4 for MoVideo: Motion-Aware Video Generation with Diffusion Models
Viaarxiv icon

Vision-Language Instruction Tuning: A Review and Analysis

Nov 25, 2023
Chen Li, Yixiao Ge, Dian Li, Ying Shan

Viaarxiv icon

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation

Oct 04, 2023
Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu

Viaarxiv icon

D4AM: A General Denoising Framework for Downstream Acoustic Models

Nov 28, 2023
Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen

Viaarxiv icon

Neural machine translation for automated feedback on children's early-stage writing

Nov 15, 2023
Jonas Vestergaard Jensen, Mikkel Jordahn, Michael Riis Andersen

Viaarxiv icon

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Nov 12, 2023
Yilin Zhao, Xinbin Yuan, Shanghua Gao, Zhijie Lin, Qibin Hou, Jiashi Feng, Daquan Zhou

Viaarxiv icon

Evaluation of GPT-4 for chest X-ray impression generation: A reader study on performance and perception

Nov 12, 2023
Sebastian Ziegelmayer, Alexander W. Marka, Nicolas Lenhart, Nadja Nehls, Stefan Reischl, Felix Harder, Andreas Sauter, Marcus Makowski, Markus Graf, Joshua Gawlitza

Figure 1 for Evaluation of GPT-4 for chest X-ray impression generation: A reader study on performance and perception
Figure 2 for Evaluation of GPT-4 for chest X-ray impression generation: A reader study on performance and perception
Figure 3 for Evaluation of GPT-4 for chest X-ray impression generation: A reader study on performance and perception
Figure 4 for Evaluation of GPT-4 for chest X-ray impression generation: A reader study on performance and perception
Viaarxiv icon

Insights into Classifying and Mitigating LLMs' Hallucinations

Nov 14, 2023
Alessandro Bruno, Pier Luigi Mazzeo, Aladine Chetouani, Marouane Tliba, Mohamed Amine Kerkouri

Viaarxiv icon

LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Nov 20, 2023
Gongwei Chen, Leyang Shen, Rui Shao, Xiang Deng, Liqiang Nie

Viaarxiv icon

Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks

Nov 14, 2023
Melanie Mitchell, Alessandro B. Palmarini, Arseny Moskvichev

Figure 1 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 2 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 3 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 4 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Viaarxiv icon