Alert button

"Text": models, code, and papers
Alert button

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Dec 23, 2023
Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko

Viaarxiv icon

Preserving Image Properties Through Initializations in Diffusion Models

Jan 04, 2024
Jeffrey Zhang, Shao-Yu Chang, Kedan Li, David Forsyth

Viaarxiv icon

Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning

Dec 12, 2023
Lifeng Han, Serge Gladkoff, Gleb Erofeev, Irina Sorokina, Betty Galiano, Goran Nenadic

Viaarxiv icon

Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets

Dec 13, 2023
Veysel Kocaman, Hasham Ul Haq, David Talby

Viaarxiv icon

Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training

Jan 02, 2024
Jiuming Qin, Che Liu, Sibo Cheng, Yike Guo, Rossella Arcucci

Viaarxiv icon

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

Jan 02, 2024
Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Dec 13, 2023
Liangchen Song, Liangliang Cao, Jiatao Gu, Yifan Jiang, Junsong Yuan, Hao Tang

Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

The Problem of Alignment

Dec 30, 2023
Tsvetelina Hristova, Liam Magee, Karen Soldatic

Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Dec 28, 2023
Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi

Viaarxiv icon

ZONE: Zero-Shot Instruction-Guided Local Editing

Dec 28, 2023
Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xuhui Liu, Jiaming Liu, Li Lin, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang

Viaarxiv icon