Alert button

"Text": models, code, and papers
Alert button

Text-To-Concept (and Back) via Cross-Model Alignment

May 10, 2023
Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi

Figure 1 for Text-To-Concept (and Back) via Cross-Model Alignment
Figure 2 for Text-To-Concept (and Back) via Cross-Model Alignment
Figure 3 for Text-To-Concept (and Back) via Cross-Model Alignment
Figure 4 for Text-To-Concept (and Back) via Cross-Model Alignment
Viaarxiv icon

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data

Aug 20, 2023
Yanda Li, Chi Zhang, Gang Yu, Zhibin Wang, Bin Fu, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei

Figure 1 for StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
Figure 2 for StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
Figure 3 for StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
Figure 4 for StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
Viaarxiv icon

Conditional Score Guidance for Text-Driven Image-to-Image Translation

May 29, 2023
Hyunsoo Lee, Minsoo Kang, Bohyung Han

Figure 1 for Conditional Score Guidance for Text-Driven Image-to-Image Translation
Figure 2 for Conditional Score Guidance for Text-Driven Image-to-Image Translation
Figure 3 for Conditional Score Guidance for Text-Driven Image-to-Image Translation
Figure 4 for Conditional Score Guidance for Text-Driven Image-to-Image Translation
Viaarxiv icon

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields

May 19, 2023
Jingbo Zhang, Xiaoyu Li, Ziyu Wan, Can Wang, Jing Liao

Figure 1 for Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Figure 2 for Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Figure 3 for Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Figure 4 for Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Viaarxiv icon

Zero-Shot Text Classification via Self-Supervised Tuning

May 19, 2023
Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

Figure 1 for Zero-Shot Text Classification via Self-Supervised Tuning
Figure 2 for Zero-Shot Text Classification via Self-Supervised Tuning
Figure 3 for Zero-Shot Text Classification via Self-Supervised Tuning
Figure 4 for Zero-Shot Text Classification via Self-Supervised Tuning
Viaarxiv icon

LEAP: Efficient and Automated Test Method for NLP Software

Aug 22, 2023
Mingxuan Xiao, Yan Xiao, Hai Dong, Shunhui Ji, Pengcheng Zhang

Figure 1 for LEAP: Efficient and Automated Test Method for NLP Software
Figure 2 for LEAP: Efficient and Automated Test Method for NLP Software
Figure 3 for LEAP: Efficient and Automated Test Method for NLP Software
Figure 4 for LEAP: Efficient and Automated Test Method for NLP Software
Viaarxiv icon

Foundation Model is Efficient Multimodal Multitask Model Selector

Aug 11, 2023
Fanqing Meng, Wenqi Shao, Zhanglin Peng, Chonghe Jiang, Kaipeng Zhang, Yu Qiao, Ping Luo

Figure 1 for Foundation Model is Efficient Multimodal Multitask Model Selector
Figure 2 for Foundation Model is Efficient Multimodal Multitask Model Selector
Figure 3 for Foundation Model is Efficient Multimodal Multitask Model Selector
Figure 4 for Foundation Model is Efficient Multimodal Multitask Model Selector
Viaarxiv icon

SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds

Jun 03, 2023
Yanyu Li, Huan Wang, Qing Jin, Ju Hu, Pavlo Chemerys, Yun Fu, Yanzhi Wang, Sergey Tulyakov, Jian Ren

Figure 1 for SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Figure 2 for SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Figure 3 for SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Figure 4 for SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Viaarxiv icon

Scalable Mask Annotation for Video Text Spotting

May 02, 2023
Haibin He, Jing Zhang, Mengyang Xu, Juhua Liu, Bo Du, Dacheng Tao

Figure 1 for Scalable Mask Annotation for Video Text Spotting
Figure 2 for Scalable Mask Annotation for Video Text Spotting
Figure 3 for Scalable Mask Annotation for Video Text Spotting
Figure 4 for Scalable Mask Annotation for Video Text Spotting
Viaarxiv icon

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

Jun 27, 2023
Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma

Figure 1 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 2 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 3 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 4 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Viaarxiv icon