Alert button

"Text": models, code, and papers
Alert button

PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns

Dec 07, 2023
Shuliang Ning, Duomin Wang, Yipeng Qin, Zirong Jin, Baoyuan Wang, Xiaoguang Han

Figure 1 for PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Figure 2 for PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Figure 3 for PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Figure 4 for PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Viaarxiv icon

Detection and Analysis of Stress-Related Posts in Reddit Acamedic Communities

Dec 02, 2023
Nazzere Oryngozha, Pakizar Shamoi, Ayan Igali

Viaarxiv icon

Localizing and Editing Knowledge in Text-to-Image Generative Models

Oct 20, 2023
Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha

Viaarxiv icon

On the Adversarial Robustness of Graph Contrastive Learning Methods

Nov 29, 2023
Filippo Guerranti, Zinuo Yi, Anna Starovoit, Rafiq Kamel, Simon Geisler, Stephan Günnemann

Figure 1 for On the Adversarial Robustness of Graph Contrastive Learning Methods
Figure 2 for On the Adversarial Robustness of Graph Contrastive Learning Methods
Figure 3 for On the Adversarial Robustness of Graph Contrastive Learning Methods
Figure 4 for On the Adversarial Robustness of Graph Contrastive Learning Methods
Viaarxiv icon

Plug-and-Play, Dense-Label-Free Extraction of Open-Vocabulary Semantic Segmentation from Vision-Language Models

Nov 28, 2023
Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal, Boyang Li

Viaarxiv icon

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Nov 28, 2023
Yijun Yang, Tianyi Zhou, Kanxue Li, Dapeng Tao, Lusong Li, Li Shen, Xiaodong He, Jing Jiang, Yuhui Shi

Figure 1 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 2 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 3 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 4 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Viaarxiv icon

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

Nov 03, 2023
Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 2 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 3 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 4 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Viaarxiv icon

Improving Activation Steering in Language Models with Mean-Centring

Dec 06, 2023
Ole Jorgensen, Dylan Cope, Nandi Schoots, Murray Shanahan

Viaarxiv icon

LaCour!: Enabling Research on Argumentation in Hearings of the European Court of Human Rights

Dec 08, 2023
Lena Held, Ivan Habernal

Viaarxiv icon

Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects

Dec 08, 2023
Junyu Lu, Ruyi Gan, Dixiang Zhang, Xiaojun Wu, Ziwei Wu, Renliang Sun, Jiaxing Zhang, Pingjian Zhang, Yan Song

Figure 1 for Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Figure 2 for Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Figure 3 for Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Figure 4 for Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Viaarxiv icon