Alert button

"Text": models, code, and papers
Alert button

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Oct 16, 2023
Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, Zhenguo Li

Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon

Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks

Nov 20, 2023
Ling Luo, Jinzhong Ning, Yingwen Zhao, Zhijun Wang, Zeyuan Ding, Peng Chen, Weiru Fu, Qinyu Han, Guangtao Xu, Yunzhi Qiu, Dinghao Pan, Jiru Li, Hao Li, Wenduo Feng, Senbo Tu, Yuqi Liu, Zhihao Yang, Jian Wang, Yuanyuan Sun, Hongfei Lin

Figure 1 for Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Figure 2 for Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Figure 3 for Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Figure 4 for Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Viaarxiv icon

Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis

Nov 28, 2023
Aman Yadav, Abhishek Vichare

Figure 1 for Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis
Viaarxiv icon

A Unified Framework for Multimodal, Multi-Part Human Motion Synthesis

Nov 28, 2023
Zixiang Zhou, Yu Wan, Baoyuan Wang

Viaarxiv icon

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction

Oct 15, 2023
Xiang Hao, Jibin Wu, Jianwei Yu, Chenglin Xu, Kay Chen Tan

Figure 1 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 2 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 3 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 4 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Viaarxiv icon

Enchancing Semi-Supervised Learning for Extractive Summarization with an LLM-based pseudolabeler

Nov 16, 2023
Gaurav Sahu, Olga Vechtomova, Issam H. Laradji

Viaarxiv icon

Efficient In-Context Learning in Vision-Language Models for Egocentric Videos

Nov 29, 2023
Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Joyce Chai

Viaarxiv icon

Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering

Nov 29, 2023
Zeqing Wang, Wentao Wan, Runmeng Chen, Qiqing Lao, Minjie Lang, Keze Wang

Viaarxiv icon

Explaining CLIP's performance disparities on data from blind/low vision users

Nov 29, 2023
Daniela Massiceti, Camilla Longden, Agnieszka Slowik, Samuel Wills, Martin Grayson, Cecily Morrison

Figure 1 for Explaining CLIP's performance disparities on data from blind/low vision users
Figure 2 for Explaining CLIP's performance disparities on data from blind/low vision users
Figure 3 for Explaining CLIP's performance disparities on data from blind/low vision users
Figure 4 for Explaining CLIP's performance disparities on data from blind/low vision users
Viaarxiv icon

ROBBIE: Robust Bias Evaluation of Large Generative Language Models

Nov 29, 2023
David Esiobu, Xiaoqing Tan, Saghar Hosseini, Megan Ung, Yuchen Zhang, Jude Fernandes, Jane Dwivedi-Yu, Eleonora Presani, Adina Williams, Eric Michael Smith

Figure 1 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 2 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 3 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 4 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Viaarxiv icon