Alert button

"Text": models, code, and papers
Alert button

Med-Flamingo: a Multimodal Medical Few-shot Learner

Jul 27, 2023
Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, Cyril Zakka, Yash Dalmia, Eduardo Pontes Reis, Pranav Rajpurkar, Jure Leskovec

Figure 1 for Med-Flamingo: a Multimodal Medical Few-shot Learner
Figure 2 for Med-Flamingo: a Multimodal Medical Few-shot Learner
Figure 3 for Med-Flamingo: a Multimodal Medical Few-shot Learner
Figure 4 for Med-Flamingo: a Multimodal Medical Few-shot Learner
Viaarxiv icon

The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech

Jun 01, 2023
Phat Do, Matt Coler, Jelske Dijkstra, Esther Klabbers

Figure 1 for The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech
Figure 2 for The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech
Figure 3 for The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech
Viaarxiv icon

Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model

Apr 28, 2023
Shishi Xiao, Suizi Huang, Yue Lin, Yilin Ye, Wei Zeng

Figure 1 for Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Figure 2 for Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Figure 3 for Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Figure 4 for Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Viaarxiv icon

Learning when to observe: A frugal reinforcement learning framework for a high-cost world

Jul 24, 2023
Colin Bellinger, Mark Crowley, Isaac Tamblyn

Viaarxiv icon

PRIOR: Prototype Representation Joint Learning from Medical Images and Reports

Jul 24, 2023
Pujin Cheng, Li Lin, Junyan Lyu, Yijin Huang, Wenhan Luo, Xiaoying Tang

Figure 1 for PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
Figure 2 for PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
Figure 3 for PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
Figure 4 for PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
Viaarxiv icon

Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework

Jul 24, 2023
Jingxuan Wei, Cheng Tan, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

Figure 1 for Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework
Figure 2 for Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework
Figure 3 for Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework
Figure 4 for Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework
Viaarxiv icon

Performance of Large Language Models in a Computer Science Degree Program

Jul 24, 2023
Tim Krüger, Michael Gref

Viaarxiv icon

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion

Jul 13, 2023
Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia

Figure 1 for AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Figure 2 for AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Figure 3 for AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Figure 4 for AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Viaarxiv icon

Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models

Apr 18, 2023
Stephen Brade, Bryan Wang, Mauricio Sousa, Sageev Oore, Tovi Grossman

Figure 1 for Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Figure 2 for Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Figure 3 for Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Figure 4 for Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Viaarxiv icon

MultiQG-TI: Towards Question Generation from Multi-modal Sources

Jul 07, 2023
Zichao Wang, Richard Baraniuk

Figure 1 for MultiQG-TI: Towards Question Generation from Multi-modal Sources
Figure 2 for MultiQG-TI: Towards Question Generation from Multi-modal Sources
Figure 3 for MultiQG-TI: Towards Question Generation from Multi-modal Sources
Figure 4 for MultiQG-TI: Towards Question Generation from Multi-modal Sources
Viaarxiv icon