Alert button

"Text": models, code, and papers
Alert button

Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes

Aug 17, 2023
Zehan Wang, Haifeng Huang, Yang Zhao, Ziang Zhang, Zhou Zhao

Figure 1 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 2 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 3 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Figure 4 for Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Viaarxiv icon

RepCL: Exploring Effective Representation for Continual Text Classification

May 12, 2023
Yifan Song, Peiyi Wang, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li

Figure 1 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 2 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 3 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 4 for RepCL: Exploring Effective Representation for Continual Text Classification
Viaarxiv icon

Unsupervised Improvement of Audio-Text Cross-Modal Representations

May 05, 2023
Zhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fabio Ayres, Paris Smaragdis

Figure 1 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 2 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 3 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 4 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Viaarxiv icon

A Neural Space-Time Representation for Text-to-Image Personalization

May 24, 2023
Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen-Or

Figure 1 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 2 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 3 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 4 for A Neural Space-Time Representation for Text-to-Image Personalization
Viaarxiv icon

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

Aug 07, 2023
Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Figure 1 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 2 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 3 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 4 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Viaarxiv icon

A Frustratingly Simple Decoding Method for Neural Text Generation

May 22, 2023
Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam, Shuming Shi

Figure 1 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 2 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 3 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 4 for A Frustratingly Simple Decoding Method for Neural Text Generation
Viaarxiv icon

PUMGPT: A Large Vision-Language Model for Product Understanding

Aug 18, 2023
Shuhui Wu, Zengming Tang, Zongyi Guo, Weiwei Zhang, Baoliang Cui, Haihong Tang, Weiming Lu

Figure 1 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 2 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 3 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 4 for PUMGPT: A Large Vision-Language Model for Product Understanding
Viaarxiv icon

Language-Guided Diffusion Model for Visual Grounding

Aug 18, 2023
Sijia Chen, Baochun Li

Figure 1 for Language-Guided Diffusion Model for Visual Grounding
Figure 2 for Language-Guided Diffusion Model for Visual Grounding
Figure 3 for Language-Guided Diffusion Model for Visual Grounding
Figure 4 for Language-Guided Diffusion Model for Visual Grounding
Viaarxiv icon

Accelerated materials language processing enabled by GPT

Aug 18, 2023
Jaewoong Choi, Byungju Lee

Figure 1 for Accelerated materials language processing enabled by GPT
Figure 2 for Accelerated materials language processing enabled by GPT
Figure 3 for Accelerated materials language processing enabled by GPT
Figure 4 for Accelerated materials language processing enabled by GPT
Viaarxiv icon

Towards Consistent Video Editing with Text-to-Image Diffusion Models

May 27, 2023
Zicheng Zhang, Bonan Li, Xuecheng Nie, Congying Han, Tiande Guo, Luoqi Liu

Figure 1 for Towards Consistent Video Editing with Text-to-Image Diffusion Models
Figure 2 for Towards Consistent Video Editing with Text-to-Image Diffusion Models
Figure 3 for Towards Consistent Video Editing with Text-to-Image Diffusion Models
Figure 4 for Towards Consistent Video Editing with Text-to-Image Diffusion Models
Viaarxiv icon