Alert button

"Text": models, code, and papers
Alert button

Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets

Dec 13, 2023
Veysel Kocaman, Hasham Ul Haq, David Talby

Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Dec 13, 2023
Liangchen Song, Liangliang Cao, Jiatao Gu, Yifan Jiang, Junsong Yuan, Hao Tang

Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

The Problem of Alignment

Dec 30, 2023
Tsvetelina Hristova, Liam Magee, Karen Soldatic

Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Dec 28, 2023
Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi

Viaarxiv icon

ZONE: Zero-Shot Instruction-Guided Local Editing

Dec 28, 2023
Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xuhui Liu, Jiaming Liu, Li Lin, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang

Viaarxiv icon

SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM

Jan 02, 2024
Weijin Cheng, Jianzhi Liu, Jiawen Deng, Fuji Ren

Viaarxiv icon

TextAug: Test time Text Augmentation for Multimodal Person Re-identification

Dec 04, 2023
Mulham Fawakherji, Eduard Vazquez, Pasquale Giampa, Binod Bhattarai

Figure 1 for TextAug: Test time Text Augmentation for Multimodal Person Re-identification
Figure 2 for TextAug: Test time Text Augmentation for Multimodal Person Re-identification
Figure 3 for TextAug: Test time Text Augmentation for Multimodal Person Re-identification
Figure 4 for TextAug: Test time Text Augmentation for Multimodal Person Re-identification
Viaarxiv icon

MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing

Nov 28, 2023
Rui Yang, Qingcheng Zeng, Keen You, Yujie Qiao, Lucas Huang, Chia-Chun Hsieh, Benjamin Rosand, Jeremy Goldwasser, Amisha D Dave, Tiarnan D. L. Keenan, Emily Y Chew, Dragomir Radev, Zhiyong Lu, Hua Xu, Qingyu Chen, Irene Li

Viaarxiv icon

Generalist embedding models are better at short-context clinical semantic search than specialized embedding models

Jan 03, 2024
Jean-Baptiste Excoffier, Tom Roehr, Alexei Figueroa, Michalis Papaaioannou, Keno Bressem, Matthieu Ortala

Viaarxiv icon

InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

Dec 10, 2023
Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu

Viaarxiv icon