Alert button

"Text": models, code, and papers
Alert button

Contextual Expressive Text-to-Speech

Nov 26, 2022
Jianhong Tu, Zeyu Cui, Xiaohuan Zhou, Siqi Zheng, Kai Hu, Ju Fan, Chang Zhou

Figure 1 for Contextual Expressive Text-to-Speech
Figure 2 for Contextual Expressive Text-to-Speech
Figure 3 for Contextual Expressive Text-to-Speech
Viaarxiv icon

AutoCoreset: An Automatic Practical Coreset Construction Framework

May 19, 2023
Alaa Maalouf, Murad Tukan, Vladimir Braverman, Daniela Rus

Figure 1 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 2 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 3 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 4 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Viaarxiv icon

Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT

Apr 29, 2023
Zhenxiang Xiao, Yuzhong Chen, Lu Zhang, Junjie Yao, Zihao Wu, Xiaowei Yu, Yi Pan, Lin Zhao, Chong Ma, Xinyu Liu, Wei Liu, Xiang Li, Yixuan Yuan, Dinggang Shen, Dajiang Zhu, Tianming Liu, Xi Jiang

Figure 1 for Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Figure 2 for Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Figure 3 for Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Figure 4 for Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Viaarxiv icon

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention

Apr 29, 2023
Xiao Liu, Jian Zhang, Heng Zhang, Fuzhao Xue, Yang You

Figure 1 for Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention
Figure 2 for Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention
Figure 3 for Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention
Figure 4 for Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention
Viaarxiv icon

Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints

Feb 17, 2023
Albert Lu, Hongxin Zhang, Yanzhe Zhang, Xuezhi Wang, Diyi Yang

Figure 1 for Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Figure 2 for Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Figure 3 for Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Figure 4 for Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Viaarxiv icon

DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting

Nov 23, 2022
Maoyuan Ye, Jing Zhang, Shanshan Zhao, Juhua Liu, Tongliang Liu, Bo Du, Dacheng Tao

Figure 1 for DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Figure 2 for DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Figure 3 for DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Figure 4 for DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Viaarxiv icon

Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources

May 13, 2023
Suraj Rajendran, Weishen Pan, Mert R. Sabuncu, Yong Chen, Jiayu Zhou, Fei Wang

Figure 1 for Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources
Figure 2 for Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources
Figure 3 for Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources
Figure 4 for Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources
Viaarxiv icon

Interpreting Vision and Language Generative Models with Semantic Visual Priors

May 04, 2023
Michele Cafagna, Lina M. Rojas-Barahona, Kees van Deemter, Albert Gatt

Figure 1 for Interpreting Vision and Language Generative Models with Semantic Visual Priors
Figure 2 for Interpreting Vision and Language Generative Models with Semantic Visual Priors
Figure 3 for Interpreting Vision and Language Generative Models with Semantic Visual Priors
Figure 4 for Interpreting Vision and Language Generative Models with Semantic Visual Priors
Viaarxiv icon

Self-conditioned Embedding Diffusion for Text Generation

Nov 08, 2022
Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond

Figure 1 for Self-conditioned Embedding Diffusion for Text Generation
Figure 2 for Self-conditioned Embedding Diffusion for Text Generation
Figure 3 for Self-conditioned Embedding Diffusion for Text Generation
Figure 4 for Self-conditioned Embedding Diffusion for Text Generation
Viaarxiv icon

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Dec 22, 2022
Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou

Figure 1 for Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Figure 2 for Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Figure 3 for Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Figure 4 for Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Viaarxiv icon