Alert button

"Information": models, code, and papers
Alert button

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Add code
Bookmark button
Alert button
Dec 05, 2023
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro

Viaarxiv icon

Robust UAV Position and Attitude Estimation using Multiple GNSS Receivers for Laser-based 3D Mapping

Dec 05, 2023
Taro Suzuki, Daichi Inoue, Yoshiharu Amano

Viaarxiv icon

Learning optimal integration of spatial and temporal information in noisy chemotaxis

Oct 16, 2023
Albert Alonso, Julius B. Kirkegaard

Figure 1 for Learning optimal integration of spatial and temporal information in noisy chemotaxis
Figure 2 for Learning optimal integration of spatial and temporal information in noisy chemotaxis
Figure 3 for Learning optimal integration of spatial and temporal information in noisy chemotaxis
Figure 4 for Learning optimal integration of spatial and temporal information in noisy chemotaxis
Viaarxiv icon

GPT Struct Me: Probing GPT Models on Narrative Entity Extraction

Nov 24, 2023
Hugo Sousa, Nuno Guimarães, Alípio Jorge, Ricardo Campos

Viaarxiv icon

The Mixtures and the Neural Critics: On the Pointwise Mutual Information Profiles of Fine Distributions

Add code
Bookmark button
Alert button
Oct 16, 2023
Paweł Czyż, Frederic Grabowski, Julia E. Vogt, Niko Beerenwinkel, Alexander Marx

Viaarxiv icon

SenTest: Evaluating Robustness of Sentence Encoders

Nov 29, 2023
Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Geetanjali Kale, Raviraj Joshi

Figure 1 for SenTest: Evaluating Robustness of Sentence Encoders
Figure 2 for SenTest: Evaluating Robustness of Sentence Encoders
Figure 3 for SenTest: Evaluating Robustness of Sentence Encoders
Figure 4 for SenTest: Evaluating Robustness of Sentence Encoders
Viaarxiv icon

C3Net: Compound Conditioned ControlNet for Multimodal Content Generation

Nov 29, 2023
Juntao Zhang, Yuehuai Liu, Yu-Wing Tai, Chi-Keung Tang

Viaarxiv icon

Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes

Add code
Bookmark button
Alert button
Nov 29, 2023
Chi-Hsi Kung, Shu-Wei Lu, Yi-Hsuan Tsai, Yi-Ting Chen

Viaarxiv icon

eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos

Add code
Bookmark button
Alert button
Nov 29, 2023
Xuecheng Wu, Heli Sun, Junxiao Xue, Ruofan Zhai, Xiangyan Kong, Jiayu Nie, Liang He

Viaarxiv icon

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation

Add code
Bookmark button
Alert button
Nov 29, 2023
Qi Liu, Xinchen Liu, Kun Liu, Xiaoyan Gu, Wu Liu

Figure 1 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 2 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 3 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 4 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Viaarxiv icon