Alert button

"Text": models, code, and papers
Alert button

MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy

Nov 15, 2023
Davis Yoshida, Kartik Goyal, Kevin Gimpel

Viaarxiv icon

Meaning Representations from Trajectories in Autoregressive Models

Nov 02, 2023
Tian Yu Liu, Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto

Viaarxiv icon

Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA

Oct 13, 2023
Sheng Zhou, Dan Guo, Jia Li, Xun Yang, Meng Wang

Figure 1 for Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
Figure 2 for Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
Figure 3 for Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
Figure 4 for Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
Viaarxiv icon

Progressive Text-to-Image Diffusion with Soft Latent Direction

Sep 18, 2023
YuTeng Ye, Jiale Cai, Hang Zhou, Guanwen Li, Youjia Zhang, Zikai Song, Chenxing Gao, Junqing Yu, Wei Yang

Figure 1 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 2 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 3 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Figure 4 for Progressive Text-to-Image Diffusion with Soft Latent Direction
Viaarxiv icon

Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning

Oct 29, 2023
Junyu Lu, Dixiang Zhang, Xiaojun Wu, Xinyu Gao, Ruyi Gan, Jiaxing Zhang, Yan Song, Pingjian Zhang

Figure 1 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 2 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 3 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Figure 4 for Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning
Viaarxiv icon

Automatic Report Generation for Histopathology images using pre-trained Vision Transformers

Nov 10, 2023
Saurav Sengupta, Donald E. Brown

Viaarxiv icon

SponTTS: modeling and transferring spontaneous style for TTS

Nov 13, 2023
Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie

Figure 1 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 2 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 3 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 4 for SponTTS: modeling and transferring spontaneous style for TTS
Viaarxiv icon

Do large language models and humans have similar behaviors in causal inference with script knowledge?

Nov 13, 2023
Xudong Hong, Margarita Ryzhova, Daniel Adrian Biondi, Vera Demberg

Viaarxiv icon

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Oct 25, 2023
Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi

Figure 1 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 2 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 3 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 4 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Viaarxiv icon

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Nov 14, 2023
Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu

Viaarxiv icon