Alert button

"Text": models, code, and papers
Alert button

LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions

Nov 20, 2023
Songhao Han, Le Zhuo, Yue Liao, Si Liu

Viaarxiv icon

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems

Nov 20, 2023
Guangjing Wang, Ce Zhou, Yuanda Wang, Bocheng Chen, Hanqing Guo, Qiben Yan

Viaarxiv icon

FLAP: Fast Language-Audio Pre-training

Nov 02, 2023
Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Gosh

Viaarxiv icon

DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation

Oct 19, 2023
Bangbang Yang, Wenqi Dong, Lin Ma, Wenbo Hu, Xiao Liu, Zhaopeng Cui, Yuewen Ma

Viaarxiv icon

A Simple Text to Video Model via Transformer

Sep 26, 2023
Gang Chen

Viaarxiv icon

CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model

Nov 10, 2023
Yang Lei, Jiangtong Li, Ming Jiang, Junjie Hu, Dawei Cheng, Zhijun Ding, Changjun Jiang

Figure 1 for CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model
Figure 2 for CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model
Figure 3 for CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model
Figure 4 for CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model
Viaarxiv icon

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Nov 10, 2023
Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan

Figure 1 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 2 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 3 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 4 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Viaarxiv icon

From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design

Nov 21, 2023
Cyril Picard, Kristen M. Edwards, Anna C. Doris, Brandon Man, Giorgio Giannone, Md Ferdous Alam, Faez Ahmed

Viaarxiv icon

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists

Sep 30, 2023
Yulu Gan, Sungwoo Park, Alexander Schubert, Anthony Philippakis, Ahmed M. Alaa

Viaarxiv icon

Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection

Nov 16, 2023
Sarah Masud, Mohammad Aflah Khan, Md. Shad Akhtar, Tanmoy Chakraborty

Viaarxiv icon