Alert button

"Text": models, code, and papers
Alert button

FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion

Mar 06, 2024
Hao Wang, Sayed Pedram Haeri Boroujeni, Xiwen Chen, Ashish Bastola, Huayu Li, Abolfazl Razi

Viaarxiv icon

AtomoVideo: High Fidelity Image-to-Video Generation

Mar 05, 2024
Litong Gong, Yiran Zhu, Weijie Li, Xiaoyang Kang, Biao Wang, Tiezheng Ge, Bo Zheng

Figure 1 for AtomoVideo: High Fidelity Image-to-Video Generation
Figure 2 for AtomoVideo: High Fidelity Image-to-Video Generation
Figure 3 for AtomoVideo: High Fidelity Image-to-Video Generation
Figure 4 for AtomoVideo: High Fidelity Image-to-Video Generation
Viaarxiv icon

PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus

Mar 01, 2024
Deborah N. Jakobi, Thomas Kern, David R. Reich, Patrick Haller, Lena A. Jäger

Figure 1 for PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
Figure 2 for PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
Figure 3 for PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
Figure 4 for PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
Viaarxiv icon

Effectiveness Assessment of Recent Large Vision-Language Models

Mar 07, 2024
Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan

Figure 1 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 2 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 3 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 4 for Effectiveness Assessment of Recent Large Vision-Language Models
Viaarxiv icon

Retrieval is Accurate Generation

Feb 29, 2024
Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi

Viaarxiv icon

JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models

Mar 05, 2024
Arefa, Mohammed Abbas Ansari, Chandni Saxena, Tanvir Ahmad

Figure 1 for JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
Figure 2 for JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
Figure 3 for JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
Figure 4 for JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
Viaarxiv icon

Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad

Mar 05, 2024
Sayantan Choudhury, Nazarii Tupitsa, Nicolas Loizou, Samuel Horvath, Martin Takac, Eduard Gorbunov

Viaarxiv icon

Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

Mar 02, 2024
Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipman

Viaarxiv icon

RouteExplainer: An Explanation Framework for Vehicle Routing Problem

Mar 06, 2024
Daisuke Kikuta, Hiroki Ikeuchi, Kengo Tajiri, Yuusuke Nakano

Figure 1 for RouteExplainer: An Explanation Framework for Vehicle Routing Problem
Figure 2 for RouteExplainer: An Explanation Framework for Vehicle Routing Problem
Figure 3 for RouteExplainer: An Explanation Framework for Vehicle Routing Problem
Figure 4 for RouteExplainer: An Explanation Framework for Vehicle Routing Problem
Viaarxiv icon

ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing

Feb 17, 2024
Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhu

Viaarxiv icon