Alert button

"Text": models, code, and papers
Alert button

Controlling Topic-Focus Articulation in Meaning-to-Text Generation using Graph Neural Networks

Oct 03, 2023
Chunliu Wang, Rik van Noord, Johan Bos

Viaarxiv icon

One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion

Nov 14, 2023
Minghua Liu, Ruoxi Shi, Linghao Chen, Zhuoyang Zhang, Chao Xu, Xinyue Wei, Hansheng Chen, Chong Zeng, Jiayuan Gu, Hao Su

Viaarxiv icon

On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

Nov 14, 2023
Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda

Viaarxiv icon

Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals

Nov 14, 2023
Young-Eun Lee, Seo-Hyun Lee, Soowon Kim, Jung-Sun Lee, Deok-Seon Kim, Seong-Whan Lee

Figure 1 for Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals
Figure 2 for Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals
Viaarxiv icon

Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

Nov 14, 2023
Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang

Figure 1 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 2 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 3 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 4 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Viaarxiv icon

Representing visual classification as a linear combination of words

Nov 18, 2023
Shobhit Agarwal, Yevgeniy R. Semenov, William Lotter

Figure 1 for Representing visual classification as a linear combination of words
Figure 2 for Representing visual classification as a linear combination of words
Figure 3 for Representing visual classification as a linear combination of words
Figure 4 for Representing visual classification as a linear combination of words
Viaarxiv icon

An Improved Neural Network Model Based On CNN Using For Fruit Sugar Degree Detection

Nov 18, 2023
Boyang Deng, Xin Wen, Zhan Gao

Viaarxiv icon

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Nov 05, 2023
Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu

Viaarxiv icon

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing

Sep 27, 2023
Brian Yan, Xuankai Chang, Antonios Anastasopoulos, Yuya Fujita, Shinji Watanabe

Figure 1 for Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing
Figure 2 for Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing
Figure 3 for Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing
Figure 4 for Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing
Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Nov 03, 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao

Viaarxiv icon