Alert button

"Text": models, code, and papers
Alert button

Efficient Pre-training for Localized Instruction Generation of Videos

Nov 27, 2023
Anil Batra, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller

Viaarxiv icon

DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination

Nov 27, 2023
Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

Figure 1 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 2 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 3 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 4 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Viaarxiv icon

Magicoder: Source Code Is All You Need

Dec 04, 2023
Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang

Viaarxiv icon

Measuring Information in Text Explanations

Oct 06, 2023
Zining Zhu, Frank Rudzicz

Viaarxiv icon

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Nov 17, 2023
Sai Saketh Rambhatla, Ishan Misra

Viaarxiv icon

Manipulating the Label Space for In-Context Classification

Dec 06, 2023
Haokun Chen, Xu Yang, Yuhang Huang, Zihan Wu, Jing Wang, Xin Geng

Viaarxiv icon

LooseControl: Lifting ControlNet for Generalized Depth Conditioning

Dec 05, 2023
Shariq Farooq Bhat, Niloy J. Mitra, Peter Wonka

Viaarxiv icon

COLE: A Hierarchical Generation Framework for Graphic Design

Nov 28, 2023
Peidong Jia, Chenxuan Li, Zeyu Liu, Yichao Shen, Xingru Chen, Yuhui Yuan, Yinglin Zheng, Dong Chen, Ji Li, Xiaodong Xie, Shanghang Zhang, Baining Guo

Viaarxiv icon

SEED-Bench-2: Benchmarking Multimodal Large Language Models

Nov 28, 2023
Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan

Viaarxiv icon

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching

Nov 21, 2023
Meng Chu, Zhedong Zheng, Wei Ji, Tat-Seng Chua

Viaarxiv icon