Alert button

"Text": models, code, and papers
Alert button

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Jan 20, 2023
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani

Figure 1 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 2 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 3 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 4 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Viaarxiv icon

The Case Records of ChatGPT: Language Models and Complex Clinical Questions

May 09, 2023
Timothy Poterucha, Pierre Elias, Christopher M. Haggerty

Viaarxiv icon

DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents

May 03, 2023
Furkan Simsek, Brian Pfitzmann, Hendrik Raetz, Jona Otholt, Haojin Yang, Christoph Meinel

Figure 1 for DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents
Figure 2 for DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents
Figure 3 for DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents
Figure 4 for DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents
Viaarxiv icon

AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking

May 03, 2023
Yuxiang Nie, Heyan Huang, Wei Wei, Xian-Ling Mao

Figure 1 for AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking
Figure 2 for AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking
Figure 3 for AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking
Figure 4 for AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking
Viaarxiv icon

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models

Mar 21, 2023
Weijia Wu, Yuzhong Zhao, Mike Zheng Shou, Hong Zhou, Chunhua Shen

Figure 1 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 2 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 3 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 4 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Viaarxiv icon

Anytime Generation of Counterfactual Explanations for Text Classification

Nov 01, 2022
Daniel Gilo, Shaul Markovitch

Figure 1 for Anytime Generation of Counterfactual Explanations for Text Classification
Figure 2 for Anytime Generation of Counterfactual Explanations for Text Classification
Figure 3 for Anytime Generation of Counterfactual Explanations for Text Classification
Figure 4 for Anytime Generation of Counterfactual Explanations for Text Classification
Viaarxiv icon

Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting

Jan 13, 2023
Lasse Hansen, Roberta Rocca, Arndis Simonsen, Alberto Parola, Vibeke Bliksted, Nicolai Ladegaard, Dan Bang, Kristian Tylén, Ethan Weed, Søren Dinesen Østergaard, Riccardo Fusaroli

Figure 1 for Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting
Figure 2 for Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting
Figure 3 for Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting
Figure 4 for Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting
Viaarxiv icon

Learning Human-Human Interactions in Images from Weak Textual Supervision

Apr 27, 2023
Morris Alper, Hadar Averbuch-Elor

Figure 1 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 2 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 3 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 4 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Viaarxiv icon

Teaching Large Language Models to Self-Debug

Apr 11, 2023
Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou

Figure 1 for Teaching Large Language Models to Self-Debug
Figure 2 for Teaching Large Language Models to Self-Debug
Figure 3 for Teaching Large Language Models to Self-Debug
Figure 4 for Teaching Large Language Models to Self-Debug
Viaarxiv icon

CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Apr 09, 2023
Dingkang Liang, Jiahao Xie, Zhikang Zou, Xiaoqing Ye, Wei Xu, Xiang Bai

Figure 1 for CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Figure 2 for CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Figure 3 for CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Figure 4 for CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Viaarxiv icon