Alert button

"Information": models, code, and papers
Alert button

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks

Jan 25, 2022
Xin Jin, Ruoyu Feng, Simeng Sun, Runsen Feng, Tianyu He, Zhibo Chen

Figure 1 for Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks
Figure 2 for Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks
Figure 3 for Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks
Figure 4 for Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks
Viaarxiv icon

WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment

Apr 22, 2022
Lin Yao, Jianfei Song, Ruizhuo Xu, Yingfang Yang, Zijian Chen, Yafeng Deng

Figure 1 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 2 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 3 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 4 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Viaarxiv icon

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism

Feb 07, 2021
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 2 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 3 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 4 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Viaarxiv icon

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Feb 08, 2022
Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis Georgiou

Figure 1 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 2 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 3 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 4 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Viaarxiv icon

Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records

Jan 25, 2022
Sicen Liu, Xiaolong Wang, Yongshuai Hou, Ge Li, Hui Wang, Hui Xu, Yang Xiang, Buzhou Tang

Figure 1 for Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records
Figure 2 for Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records
Figure 3 for Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records
Figure 4 for Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records
Viaarxiv icon

Large-scale Bilingual Language-Image Contrastive Learning

Add code
Bookmark button
Alert button
Apr 15, 2022
Byungsoo Ko, Geonmo Gu

Figure 1 for Large-scale Bilingual Language-Image Contrastive Learning
Figure 2 for Large-scale Bilingual Language-Image Contrastive Learning
Figure 3 for Large-scale Bilingual Language-Image Contrastive Learning
Figure 4 for Large-scale Bilingual Language-Image Contrastive Learning
Viaarxiv icon

Adaptive graph convolutional networks for weakly supervised anomaly detection in videos

Feb 14, 2022
Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

Figure 1 for Adaptive graph convolutional networks for weakly supervised anomaly detection in videos
Figure 2 for Adaptive graph convolutional networks for weakly supervised anomaly detection in videos
Figure 3 for Adaptive graph convolutional networks for weakly supervised anomaly detection in videos
Figure 4 for Adaptive graph convolutional networks for weakly supervised anomaly detection in videos
Viaarxiv icon

Multiple-environment Self-adaptive Network for Aerial-view Geo-localization

Apr 18, 2022
Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Tat-Seng Chua, Yi Yang, Chenggang Yan

Figure 1 for Multiple-environment Self-adaptive Network for Aerial-view Geo-localization
Figure 2 for Multiple-environment Self-adaptive Network for Aerial-view Geo-localization
Figure 3 for Multiple-environment Self-adaptive Network for Aerial-view Geo-localization
Figure 4 for Multiple-environment Self-adaptive Network for Aerial-view Geo-localization
Viaarxiv icon

Counterfactual Regret Minimization for Anti-jamming Game of Frequency Agile Radar

Feb 21, 2022
Huayue Li, Zhaowei Han, Wenqiang Pu, Liangqi Liu, Kang Li, Bo Jiu

Figure 1 for Counterfactual Regret Minimization for Anti-jamming Game of Frequency Agile Radar
Figure 2 for Counterfactual Regret Minimization for Anti-jamming Game of Frequency Agile Radar
Figure 3 for Counterfactual Regret Minimization for Anti-jamming Game of Frequency Agile Radar
Figure 4 for Counterfactual Regret Minimization for Anti-jamming Game of Frequency Agile Radar
Viaarxiv icon

Building a 3-Player Mahjong AI using Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 25, 2022
Xiangyu Zhao, Sean B. Holden

Figure 1 for Building a 3-Player Mahjong AI using Deep Reinforcement Learning
Figure 2 for Building a 3-Player Mahjong AI using Deep Reinforcement Learning
Figure 3 for Building a 3-Player Mahjong AI using Deep Reinforcement Learning
Figure 4 for Building a 3-Player Mahjong AI using Deep Reinforcement Learning
Viaarxiv icon