Alert button

"Text": models, code, and papers
Alert button

PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Nov 29, 2023
Zuoyan Zhao, Shipeng Zhu, Pengfei Fang, Hui Xue

Viaarxiv icon

Benchmarking PathCLIP for Pathology Image Analysis

Jan 05, 2024
Sunyi Zheng, Xiaonan Cui, Yuxuan Sun, Jingxiong Li, Honglin Li, Yunlong Zhang, Pingyi Chen, Xueping Jing, Zhaoxiang Ye, Lin Yang

Viaarxiv icon

InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following

Dec 30, 2023
Shufan Li, Harkanwar Singh, Aditya Grover

Viaarxiv icon

Building Efficient Universal Classifiers with Natural Language Inference

Dec 29, 2023
Moritz Laurer, Wouter van Atteveldt, Andreu Casas, Kasper Welbers

Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Dec 23, 2023
Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko

Viaarxiv icon

Preserving Image Properties Through Initializations in Diffusion Models

Jan 04, 2024
Jeffrey Zhang, Shao-Yu Chang, Kedan Li, David Forsyth

Viaarxiv icon

Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training

Jan 02, 2024
Jiuming Qin, Che Liu, Sibo Cheng, Yike Guo, Rossella Arcucci

Viaarxiv icon

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

Jan 02, 2024
Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

Viaarxiv icon

Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning

Dec 12, 2023
Lifeng Han, Serge Gladkoff, Gleb Erofeev, Irina Sorokina, Betty Galiano, Goran Nenadic

Viaarxiv icon

Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets

Dec 13, 2023
Veysel Kocaman, Hasham Ul Haq, David Talby

Viaarxiv icon