Alert button

"Text": models, code, and papers
Alert button

GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks

Nov 02, 2023
Xinlu Zhang, Yujie Lu, Weizhi Wang, An Yan, Jun Yan, Lianke Qin, Heng Wang, Xifeng Yan, William Yang Wang, Linda Ruth Petzold

Viaarxiv icon

Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer

Oct 05, 2023
Paul-Ambroise Duquenne, Holger Schwenk, Benoît Sagot

Figure 1 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 2 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 3 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 4 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Viaarxiv icon

Controlling Topic-Focus Articulation in Meaning-to-Text Generation using Graph Neural Networks

Oct 03, 2023
Chunliu Wang, Rik van Noord, Johan Bos

Viaarxiv icon

One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion

Nov 14, 2023
Minghua Liu, Ruoxi Shi, Linghao Chen, Zhuoyang Zhang, Chao Xu, Xinyue Wei, Hansheng Chen, Chong Zeng, Jiayuan Gu, Hao Su

Viaarxiv icon

On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

Nov 14, 2023
Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda

Viaarxiv icon

Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals

Nov 14, 2023
Young-Eun Lee, Seo-Hyun Lee, Soowon Kim, Jung-Sun Lee, Deok-Seon Kim, Seong-Whan Lee

Figure 1 for Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals
Figure 2 for Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals
Viaarxiv icon

Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

Nov 14, 2023
Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang

Figure 1 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 2 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 3 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 4 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Viaarxiv icon

Representing visual classification as a linear combination of words

Nov 18, 2023
Shobhit Agarwal, Yevgeniy R. Semenov, William Lotter

Figure 1 for Representing visual classification as a linear combination of words
Figure 2 for Representing visual classification as a linear combination of words
Figure 3 for Representing visual classification as a linear combination of words
Figure 4 for Representing visual classification as a linear combination of words
Viaarxiv icon

An Improved Neural Network Model Based On CNN Using For Fruit Sugar Degree Detection

Nov 18, 2023
Boyang Deng, Xin Wen, Zhan Gao

Viaarxiv icon

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Nov 05, 2023
Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu

Viaarxiv icon