Alert button

"Text": models, code, and papers
Alert button

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

Feb 20, 2024
Nailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang

Viaarxiv icon

Reasoning before Comparison: LLM-Enhanced Semantic Similarity Metrics for Domain Specialized Text Analysis

Feb 20, 2024
Shaochen Xu, Zihao Wu, Huaqin Zhao, Peng Shu, Zhengliang Liu, Wenxiong Liao, Sheng Li, Andrea Sikora, Tianming Liu, Xiang Li

Viaarxiv icon

Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization

Mar 06, 2024
Kaiwei Zhang, Yange Lin, Guangcheng Wu, Yuxiang Ren, Xuecang Zhang, Bo wang, Xiaoyu Zhang, Weitao Du

Figure 1 for Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization
Figure 2 for Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization
Figure 3 for Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization
Figure 4 for Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization
Viaarxiv icon

Learned Image Compression with Text Quality Enhancement

Feb 13, 2024
Chih-Yu Lai, Dung Tran, Kazuhito Koishida

Viaarxiv icon

One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models

Mar 04, 2024
Lin Li, Haoyan Guan, Jianing Qiu, Michael Spratling

Figure 1 for One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Figure 2 for One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Figure 3 for One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Figure 4 for One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Viaarxiv icon

How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

Mar 04, 2024
Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

Viaarxiv icon

VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model

Mar 08, 2024
Junsu Kim, Yunhoe Ku, Jihyeon Kim, Junuk Cha, Seungryul Baek

Figure 1 for VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model
Figure 2 for VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model
Figure 3 for VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model
Figure 4 for VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model
Viaarxiv icon

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model

Mar 08, 2024
Pengwei Yin, Guanzhong Zeng, Jingjing Wang, Di Xie

Figure 1 for CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Figure 2 for CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Figure 3 for CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Figure 4 for CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Viaarxiv icon

How Far Are We from Intelligent Visual Deductive Reasoning?

Mar 08, 2024
Yizhe Zhang, He Bai, Ruixiang Zhang, Jiatao Gu, Shuangfei Zhai, Josh Susskind, Navdeep Jaitly

Figure 1 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 2 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 3 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 4 for How Far Are We from Intelligent Visual Deductive Reasoning?
Viaarxiv icon

LLMBind: A Unified Modality-Task Integration Framework

Mar 08, 2024
Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Jiaxi Cui, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan

Viaarxiv icon