Alert button

"Text": models, code, and papers
Alert button

The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding

Nov 29, 2023
Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi

Figure 1 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 2 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 3 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 4 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Viaarxiv icon

The Falcon Series of Open Language Models

Nov 29, 2023
Ebtesam Almazrouei, Hamza Alobeidli, Abdulaziz Alshamsi, Alessandro Cappelli, Ruxandra Cojocaru, Mérouane Debbah, Étienne Goffinet, Daniel Hesslow, Julien Launay, Quentin Malartic, Daniele Mazzotta, Badreddine Noune, Baptiste Pannier, Guilherme Penedo

Viaarxiv icon

Zero-shot audio captioning with audio-language model guidance and audio context keywords

Nov 14, 2023
Leonard Salewski, Stefan Fauth, A. Sophia Koepke, Zeynep Akata

Figure 1 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Figure 2 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Figure 3 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Viaarxiv icon

Tell2Design: A Dataset for Language-Guided Floor Plan Generation

Nov 27, 2023
Sicong Leng, Yang Zhou, Mohammed Haroon Dupty, Wee Sun Lee, Sam Conrad Joyce, Wei Lu

Viaarxiv icon

Reinforcement Learning from Diffusion Feedback: Q* for Image Search

Nov 27, 2023
Aboli Marathe

Viaarxiv icon

Benchmarking Large Language Model Volatility

Nov 26, 2023
Boyang Yu

Viaarxiv icon

mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model

Nov 30, 2023
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang

Viaarxiv icon

Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention

Nov 30, 2023
Lujia Shen, Yuwen Pu, Shouling Ji, Changjiang Li, Xuhong Zhang, Chunpeng Ge, Ting Wang

Viaarxiv icon

HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Nov 28, 2023
Jingbo Zhang, Xiaoyu Li, Qi Zhang, Yanpei Cao, Ying Shan, Jing Liao

Viaarxiv icon

Pragmatic Radiology Report Generation

Nov 28, 2023
Dang Nguyen, Chacha Chen, He He, Chenhao Tan

Viaarxiv icon