Alert button

"Text": models, code, and papers
Alert button

Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines

Jul 03, 2023
Ellery Smith, Rahel Paloots, Dimitris Giagkos, Michael Baudis, Kurt Stockinger

Figure 1 for Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines
Figure 2 for Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines
Figure 3 for Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines
Figure 4 for Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines
Viaarxiv icon

Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy

Jun 04, 2023
Xiaoting Li, Lingwei Chen, Dinghao Wu

Figure 1 for Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy
Figure 2 for Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy
Figure 3 for Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy
Figure 4 for Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy
Viaarxiv icon

Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model

Feb 27, 2023
Jaeyoung Huh, Sangjoon Park, Jeong Eun Lee, Jong Chul Ye

Figure 1 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 2 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 3 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 4 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Viaarxiv icon

Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

Jul 06, 2023
Netta Madvil, Yonatan Bitton, Roy Schwartz

Figure 1 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 2 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 3 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 4 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Viaarxiv icon

Geometric Perception based Efficient Text Recognition

Feb 08, 2023
P. N. Deelaka, D. R. Jayakodi, D. Y. Silva

Figure 1 for Geometric Perception based Efficient Text Recognition
Figure 2 for Geometric Perception based Efficient Text Recognition
Figure 3 for Geometric Perception based Efficient Text Recognition
Figure 4 for Geometric Perception based Efficient Text Recognition
Viaarxiv icon

LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning

Jun 10, 2023
Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa

Figure 1 for LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
Figure 2 for LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
Figure 3 for LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
Figure 4 for LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
Viaarxiv icon

Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding

Mar 31, 2023
Xiang Zhang, Taoyue Wang, Xiaotian Li, Huiyuan Yang, Lijun Yin

Figure 1 for Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding
Figure 2 for Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding
Figure 3 for Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding
Figure 4 for Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding
Viaarxiv icon

The Art of Embedding Fusion: Optimizing Hate Speech Detection

Jun 26, 2023
Mohammad Aflah Khan, Neemesh Yadav, Mohit Jain, Sanyam Goyal

Figure 1 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 2 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 3 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 4 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Viaarxiv icon

PuMer: Pruning and Merging Tokens for Efficient Vision Language Models

May 27, 2023
Qingqing Cao, Bhargavi Paranjape, Hannaneh Hajishirzi

Figure 1 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 2 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 3 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 4 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Viaarxiv icon

LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading

Jun 05, 2023
Yochai Yemini, Aviv Shamsian, Lior Bracha, Sharon Gannot, Ethan Fetaya

Figure 1 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 2 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 3 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 4 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Viaarxiv icon