Alert button

"Information": models, code, and papers
Alert button

Accurate and Fast Compressed Video Captioning

Sep 22, 2023
Yaojie Shen, Xin Gu, Kai Xu, Heng Fan, Longyin Wen, Libo Zhang

Figure 1 for Accurate and Fast Compressed Video Captioning
Figure 2 for Accurate and Fast Compressed Video Captioning
Figure 3 for Accurate and Fast Compressed Video Captioning
Figure 4 for Accurate and Fast Compressed Video Captioning
Viaarxiv icon

MUTEX: Learning Unified Policies from Multimodal Task Specifications

Sep 25, 2023
Rutav Shah, Roberto Martín-Martín, Yuke Zhu

Figure 1 for MUTEX: Learning Unified Policies from Multimodal Task Specifications
Figure 2 for MUTEX: Learning Unified Policies from Multimodal Task Specifications
Figure 3 for MUTEX: Learning Unified Policies from Multimodal Task Specifications
Figure 4 for MUTEX: Learning Unified Policies from Multimodal Task Specifications
Viaarxiv icon

Learned Contextual LiDAR Informed Visual Search in Unseen Environments

Sep 25, 2023
Ryan Gupta, Kyle Morgenstein, Steven Ortega, Luis Sentis

Viaarxiv icon

Newton Method-based Subspace Support Vector Data Description

Sep 25, 2023
Fahad Sohrab, Firas Laakom, Moncef Gabbouj

Viaarxiv icon

Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement

Sep 19, 2023
Jiahui Pan, Shulin He, Hui Zhang, Xueliang Zhang

Figure 1 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 2 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 3 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 4 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Sep 29, 2023
Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur

Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer

Oct 04, 2023
Hongruixuan Chen, Cuiling Lan, Jian Song, Clifford Broni-Bediako, Junshi Xia, Naoto Yokoya

Figure 1 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 2 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 3 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 4 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Viaarxiv icon

Wavelet-based Topological Loss for Low-Light Image Denoising

Sep 20, 2023
Alexandra Malyugina, Nantheera Anantrasirichai, David Bull

Viaarxiv icon

Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning

Aug 14, 2023
Rui Liu, Yuanyuan Chen, Anran Li, Yi Ding, Han Yu, Cuntai Guan

Figure 1 for Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning
Figure 2 for Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning
Figure 3 for Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning
Figure 4 for Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning
Viaarxiv icon

Mirror Diffusion Models for Constrained and Watermarked Generation

Oct 02, 2023
Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao

Viaarxiv icon