Alert button

"Text": models, code, and papers
Alert button

Progressive Text-to-Image Diffusion with Soft Latent Direction

Sep 18, 2023
YuTeng Ye, Jiale Cai, Hang Zhou, Guanwen Li, Youjia Zhang, Zikai Song, Chenxing Gao, Junqing Yu, Wei Yang

Figure 1 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 2 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 3 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 4 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Viaarxiv icon

Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning

Oct 29, 2023
Junyu Lu, Dixiang Zhang, Xiaojun Wu, Xinyu Gao, Ruyi Gan, Jiaxing Zhang, Yan Song, Pingjian Zhang

Figure 1 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 2 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 3 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 4 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Viaarxiv icon

Automatic Report Generation for Histopathology images using pre-trained Vision Transformers

Nov 10, 2023
Saurav Sengupta, Donald E. Brown

Viaarxiv icon

SponTTS: modeling and transferring spontaneous style for TTS

Nov 13, 2023
Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie

Figure 1 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 2 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 3 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 4 for SponTTS: modeling and transferring spontaneous style for TTS
Viaarxiv icon

Do large language models and humans have similar behaviors in causal inference with script knowledge?

Nov 13, 2023
Xudong Hong, Margarita Ryzhova, Daniel Adrian Biondi, Vera Demberg

Viaarxiv icon

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Oct 25, 2023
Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi

Figure 1 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 2 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 3 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 4 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Viaarxiv icon

BERT Lost Patience Won't Be Robust to Adversarial Slowdown

Oct 29, 2023
Zachary Coalson, Gabriel Ritter, Rakesh Bobba, Sanghyun Hong

Viaarxiv icon

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Nov 14, 2023
Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu

Viaarxiv icon

Quality-Diversity through AI Feedback

Oct 31, 2023
Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

Viaarxiv icon

What's In My Big Data?

Oct 31, 2023
Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge

Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon