Alert button

"Information": models, code, and papers
Alert button

Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning

Dec 10, 2022
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng

Figure 1 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 2 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 3 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 4 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Viaarxiv icon

Progressive Multi-view Human Mesh Recovery with Self-Supervision

Dec 10, 2022
Xuan Gong, Liangchen Song, Meng Zheng, Benjamin Planche, Terrence Chen, Junsong Yuan, David Doermann, Ziyan Wu

Figure 1 for Progressive Multi-view Human Mesh Recovery with Self-Supervision
Figure 2 for Progressive Multi-view Human Mesh Recovery with Self-Supervision
Figure 3 for Progressive Multi-view Human Mesh Recovery with Self-Supervision
Figure 4 for Progressive Multi-view Human Mesh Recovery with Self-Supervision
Viaarxiv icon

GCT: Gated Contextual Transformer for Sequential Audio Tagging

Oct 22, 2022
Yuanbo Hou, Yun Wang, Wenwu Wang, Dick Botteldooren

Figure 1 for GCT: Gated Contextual Transformer for Sequential Audio Tagging
Figure 2 for GCT: Gated Contextual Transformer for Sequential Audio Tagging
Figure 3 for GCT: Gated Contextual Transformer for Sequential Audio Tagging
Figure 4 for GCT: Gated Contextual Transformer for Sequential Audio Tagging
Viaarxiv icon

FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering

Nov 18, 2022
Akhil Kedia, Mohd Abbas Zaidi, Haejun Lee

Figure 1 for FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering
Figure 2 for FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering
Figure 3 for FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering
Figure 4 for FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering
Viaarxiv icon

Controlling Bias Exposure for Fair Interpretable Predictions

Oct 14, 2022
Zexue He, Yu Wang, Julian McAuley, Bodhisattwa Prasad Majumder

Figure 1 for Controlling Bias Exposure for Fair Interpretable Predictions
Figure 2 for Controlling Bias Exposure for Fair Interpretable Predictions
Figure 3 for Controlling Bias Exposure for Fair Interpretable Predictions
Figure 4 for Controlling Bias Exposure for Fair Interpretable Predictions
Viaarxiv icon

Unimodal and Multimodal Representation Training for Relation Extraction

Nov 11, 2022
Ciaran Cooney, Rachel Heyburn, Liam Maddigan, Mairead O'Cuinn, Chloe Thompson, Joana Cavadas

Figure 1 for Unimodal and Multimodal Representation Training for Relation Extraction
Figure 2 for Unimodal and Multimodal Representation Training for Relation Extraction
Figure 3 for Unimodal and Multimodal Representation Training for Relation Extraction
Figure 4 for Unimodal and Multimodal Representation Training for Relation Extraction
Viaarxiv icon

Using dynamic circles and squares to visualize spatio-temporal variation

Nov 11, 2022
Harsh Patel, Nicole Schneider, Hanan Samet

Figure 1 for Using dynamic circles and squares to visualize spatio-temporal variation
Figure 2 for Using dynamic circles and squares to visualize spatio-temporal variation
Figure 3 for Using dynamic circles and squares to visualize spatio-temporal variation
Figure 4 for Using dynamic circles and squares to visualize spatio-temporal variation
Viaarxiv icon

Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2022): Workshop and Shared Task Report

Nov 21, 2022
Ali Hürriyetoğlu, Hristo Tanev, Vanni Zavarella, Reyyan Yeniterzi, Osman Mutlu, Erdem Yörük

Viaarxiv icon

Reinforced Language Modeling for End-to-End Task Oriented Dialog

Nov 30, 2022
Xiao Yu, Qingyang Wu, Kun Qian, Zhou Yu

Figure 1 for Reinforced Language Modeling for End-to-End Task Oriented Dialog
Figure 2 for Reinforced Language Modeling for End-to-End Task Oriented Dialog
Figure 3 for Reinforced Language Modeling for End-to-End Task Oriented Dialog
Figure 4 for Reinforced Language Modeling for End-to-End Task Oriented Dialog
Viaarxiv icon

A perspective on the use of health digital twins in computational pathology

Nov 30, 2022
Manuel Cossio

Figure 1 for A perspective on the use of health digital twins in computational pathology
Figure 2 for A perspective on the use of health digital twins in computational pathology
Figure 3 for A perspective on the use of health digital twins in computational pathology
Viaarxiv icon