Alert button

"Text": models, code, and papers
Alert button

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting

Jan 15, 2024
Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin

Viaarxiv icon

MusicRL: Aligning Music Generation to Human Preferences

Feb 06, 2024
Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli

Viaarxiv icon

Anchor-based Large Language Models

Feb 12, 2024
Jianhui Pang, Fanghua Ye, Derek F. Wong, Longyue Wang

Viaarxiv icon

Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

Feb 08, 2024
Shufan Li, Harkanwar Singh, Aditya Grover

Viaarxiv icon

Text2Data: Low-Resource Data Generation with Textual Control

Feb 08, 2024
Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese

Viaarxiv icon

Animated Stickers: Bringing Stickers to Life with Video Diffusion

Feb 08, 2024
David Yan, Winnie Zhang, Luxin Zhang, Anmol Kalia, Dingkang Wang, Ankit Ramchandani, Miao Liu, Albert Pumarola, Edgar Schoenfeld, Elliot Blanchard, Krishna Narni, Yaqiao Luo, Lawrence Chen, Guan Pang, Ali Thabet, Peter Vajda, Amy Bearman, Licheng Yu

Viaarxiv icon

Pixel Sentence Representation Learning

Feb 13, 2024
Chenghao Xiao, Zhuoxu Huang, Danlu Chen, G Thomas Hudson, Yizhi Li, Haoran Duan, Chenghua Lin, Jie Fu, Jungong Han, Noura Al Moubayed

Viaarxiv icon

JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models

Feb 13, 2024
Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaid Harchaoui, Yejin Choi

Viaarxiv icon

GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding

Feb 09, 2024
Stefan Dernbach, Khushbu Agarwal, Alejandro Zuniga, Michael Henry, Sutanay Choudhury

Viaarxiv icon

Large Language Models for Captioning and Retrieving Remote Sensing Images

Feb 09, 2024
João Daniel Silva, João Magalhães, Devis Tuia, Bruno Martins

Viaarxiv icon