Alert button

"Text": models, code, and papers
Alert button

TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion

Mar 02, 2024
Salaheldin Mohamed

Figure 1 for TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion
Figure 2 for TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion
Figure 3 for TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion
Figure 4 for TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion
Viaarxiv icon

A Two-Stage Dual-Path Framework for Text Tampering Detection and Recognition

Feb 22, 2024
Guandong Li, Xian Yang, Wenpin Ma

Viaarxiv icon

How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

Mar 01, 2024
Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

Viaarxiv icon

PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset

Mar 06, 2024
Arda Uzunoglu, Abdalfatah Rashid Safa, Gözde Gül Şahin

Figure 1 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 2 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 3 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 4 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Viaarxiv icon

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Feb 18, 2024
Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He

Viaarxiv icon

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Mar 05, 2024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

Mar 02, 2024
Shuo Yang, Zirui Shang, Yongqi Wang, Derong Deng, Hongwei Chen, Qiyuan Cheng, Xinxiao Wu

Figure 1 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 2 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 3 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 4 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Viaarxiv icon

From Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text Embeddings

Feb 25, 2024
Hao Wang, Hao Li, Minlie Huang, Lei Sha

Viaarxiv icon

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Feb 29, 2024
Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun

Viaarxiv icon

Backtracing: Retrieving the Cause of the Query

Mar 06, 2024
Rose E. Wang, Pawan Wirawarn, Omar Khattab, Noah Goodman, Dorottya Demszky

Figure 1 for Backtracing: Retrieving the Cause of the Query
Figure 2 for Backtracing: Retrieving the Cause of the Query
Figure 3 for Backtracing: Retrieving the Cause of the Query
Figure 4 for Backtracing: Retrieving the Cause of the Query
Viaarxiv icon