Alert button

"Text": models, code, and papers
Alert button

SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing

May 30, 2023
Nazmul Karim, Umar Khalid, Mohsen Joneidi, Chen Chen, Nazanin Rahnavard

Figure 1 for SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing
Figure 2 for SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing
Figure 3 for SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing
Figure 4 for SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing
Viaarxiv icon

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Jun 01, 2023
Sameer Jain, Vaishakh Keshava, Swarnashree Mysore Sathyendra, Patrick Fernandes, Pengfei Liu, Graham Neubig, Chunting Zhou

Figure 1 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
Figure 2 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
Figure 3 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
Figure 4 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
Viaarxiv icon

HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance

May 31, 2023
Junzhe Zhu, Peiye Zhuang

Figure 1 for HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Figure 2 for HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Figure 3 for HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Figure 4 for HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Viaarxiv icon

Masked and Permuted Implicit Context Learning for Scene Text Recognition

May 25, 2023
Xiaomeng Yang, Zhi Qiao, Jin Wei, Yu Zhou, Ye Yuan, Zhilong Ji, Dongbao Yang, Weiping Wang

Figure 1 for Masked and Permuted Implicit Context Learning for Scene Text Recognition
Figure 2 for Masked and Permuted Implicit Context Learning for Scene Text Recognition
Figure 3 for Masked and Permuted Implicit Context Learning for Scene Text Recognition
Figure 4 for Masked and Permuted Implicit Context Learning for Scene Text Recognition
Viaarxiv icon

Classifying Dementia in the Presence of Depression: A Cross-Corpus Study

Aug 16, 2023
Franziska Braun, Sebastian P. Bayerl, Paula A. Pérez-Toro, Florian Hönig, Hartmut Lehfeld, Thomas Hillemacher, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer

Figure 1 for Classifying Dementia in the Presence of Depression: A Cross-Corpus Study
Figure 2 for Classifying Dementia in the Presence of Depression: A Cross-Corpus Study
Viaarxiv icon

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

May 25, 2023
Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong

Figure 1 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 2 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 3 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 4 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Viaarxiv icon

Pre-Trained Large Language Models for Industrial Control

Aug 06, 2023
Lei Song, Chuheng Zhang, Li Zhao, Jiang Bian

Viaarxiv icon

Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models

Aug 18, 2023
Navid Rajabi, Jana Kosecka

Figure 1 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 2 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 3 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 4 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Viaarxiv icon

ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

May 23, 2023
William Brannon, Suyash Fulay, Hang Jiang, Wonjune Kang, Brandon Roy, Jad Kabbara, Deb Roy

Figure 1 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 2 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 3 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 4 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Viaarxiv icon

Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models

May 23, 2023
Dong Wei, Xiaoning Sun, Huaijiang Sun, Bin Li, Shengxiang Hu, Weiqing Li, Jianfeng Lu

Figure 1 for Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Figure 2 for Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Figure 3 for Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Figure 4 for Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Viaarxiv icon