Alert button

"Text": models, code, and papers
Alert button

A Comparative Study of Pretrained Language Models for Long Clinical Text

Jan 27, 2023
Yikuan Li, Ramsey M. Wehbe, Faraz S. Ahmad, Hanyin Wang, Yuan Luo

Figure 1 for A Comparative Study of Pretrained Language Models for Long Clinical Text
Figure 2 for A Comparative Study of Pretrained Language Models for Long Clinical Text
Figure 3 for A Comparative Study of Pretrained Language Models for Long Clinical Text
Figure 4 for A Comparative Study of Pretrained Language Models for Long Clinical Text
Viaarxiv icon

JOIST: A Joint Speech and Text Streaming Model For ASR

Oct 13, 2022
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman

Figure 1 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 2 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 3 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 4 for JOIST: A Joint Speech and Text Streaming Model For ASR
Viaarxiv icon

OPI at SemEval 2023 Task 1: Image-Text Embeddings and Multimodal Information Retrieval for Visual Word Sense Disambiguation

Apr 14, 2023
Sławomir Dadas

Figure 1 for OPI at SemEval 2023 Task 1: Image-Text Embeddings and Multimodal Information Retrieval for Visual Word Sense Disambiguation
Figure 2 for OPI at SemEval 2023 Task 1: Image-Text Embeddings and Multimodal Information Retrieval for Visual Word Sense Disambiguation
Figure 3 for OPI at SemEval 2023 Task 1: Image-Text Embeddings and Multimodal Information Retrieval for Visual Word Sense Disambiguation
Figure 4 for OPI at SemEval 2023 Task 1: Image-Text Embeddings and Multimodal Information Retrieval for Visual Word Sense Disambiguation
Viaarxiv icon

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Dec 16, 2022
Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlicek, Alexei V. Ivanov, Aravind Ganapathiraju

Figure 1 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 2 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 3 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 4 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Viaarxiv icon

VideoXum: Cross-modal Visual and Textural Summarization of Videos

Mar 21, 2023
Jingyang Lin, Hang Hua, Ming Chen, Yikang Li, Jenhao Hsiao, Chiuman Ho, Jiebo Luo

Figure 1 for VideoXum: Cross-modal Visual and Textural Summarization of Videos
Figure 2 for VideoXum: Cross-modal Visual and Textural Summarization of Videos
Figure 3 for VideoXum: Cross-modal Visual and Textural Summarization of Videos
Figure 4 for VideoXum: Cross-modal Visual and Textural Summarization of Videos
Viaarxiv icon

JPEG Compressed Images Can Bypass Protections Against AI Editing

Apr 07, 2023
Pedro Sandoval-Segura, Jonas Geiping, Tom Goldstein

Figure 1 for JPEG Compressed Images Can Bypass Protections Against AI Editing
Figure 2 for JPEG Compressed Images Can Bypass Protections Against AI Editing
Figure 3 for JPEG Compressed Images Can Bypass Protections Against AI Editing
Figure 4 for JPEG Compressed Images Can Bypass Protections Against AI Editing
Viaarxiv icon

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models

Apr 05, 2023
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao

Figure 1 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 2 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 3 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 4 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Viaarxiv icon

Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation

Dec 20, 2022
Xinyu Pi, Bing Wang, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou

Figure 1 for Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
Figure 2 for Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
Figure 3 for Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
Figure 4 for Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
Viaarxiv icon

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

Apr 01, 2023
Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Figure 1 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 2 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 3 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 4 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Viaarxiv icon

Local Contrastive Learning for Medical Image Recognition

Mar 24, 2023
S. A. Rizvi, R. Tang, X. Jiang, X. Ma, X. Hu

Figure 1 for Local Contrastive Learning for Medical Image Recognition
Figure 2 for Local Contrastive Learning for Medical Image Recognition
Figure 3 for Local Contrastive Learning for Medical Image Recognition
Figure 4 for Local Contrastive Learning for Medical Image Recognition
Viaarxiv icon