Alert button

"Text": models, code, and papers
Alert button

LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Nov 20, 2023
Gongwei Chen, Leyang Shen, Rui Shao, Xiang Deng, Liqiang Nie

Viaarxiv icon

Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks

Nov 14, 2023
Melanie Mitchell, Alessandro B. Palmarini, Arseny Moskvichev

Figure 1 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 2 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 3 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 4 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Viaarxiv icon

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

Oct 05, 2023
Mihir Prabhudesai, Anirudh Goyal, Deepak Pathak, Katerina Fragkiadaki

Figure 1 for Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Figure 2 for Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Figure 3 for Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Figure 4 for Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Viaarxiv icon

ASPIRO: Any-shot Structured Parsing-error-Induced ReprOmpting for Consistent Data-to-Text Generation

Oct 27, 2023
Martin Vejvar, Yasutaka Fujimoto

Viaarxiv icon

GAIA: Zero-shot Talking Avatar Generation

Nov 26, 2023
Tianyu He, Junliang Guo, Runyi Yu, Yuchi Wang, Jialiang Zhu, Kaikai An, Leyi Li, Xu Tan, Chunyu Wang, Han Hu, HsiangTao Wu, Sheng Zhao, Jiang Bian

Figure 1 for GAIA: Zero-shot Talking Avatar Generation
Figure 2 for GAIA: Zero-shot Talking Avatar Generation
Figure 3 for GAIA: Zero-shot Talking Avatar Generation
Figure 4 for GAIA: Zero-shot Talking Avatar Generation
Viaarxiv icon

PISA: Point-cloud-based Instructed Scene Augmentation

Nov 26, 2023
Yiyang Luo, Ke Lin

Figure 1 for PISA: Point-cloud-based Instructed Scene Augmentation
Figure 2 for PISA: Point-cloud-based Instructed Scene Augmentation
Figure 3 for PISA: Point-cloud-based Instructed Scene Augmentation
Figure 4 for PISA: Point-cloud-based Instructed Scene Augmentation
Viaarxiv icon

Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Oct 17, 2023
Tomohito Kasahara, Daisuke Kawahara

Viaarxiv icon

Extracting Definienda in Mathematical Scholarly Articles with Transformers

Nov 21, 2023
Shufan Jiang, Pierre Senellart

Viaarxiv icon

Accuracy of a Vision-Language Model on Challenging Medical Cases

Nov 09, 2023
Thomas Buckley, James A. Diao, Adam Rodman, Arjun K. Manrai

Viaarxiv icon

AutoKG: Efficient Automated Knowledge Graph Generation for Language Models

Nov 22, 2023
Bohan Chen, Andrea L. Bertozzi

Viaarxiv icon