Alert button

"Text": models, code, and papers
Alert button

Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence

Feb 15, 2024
Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

Viaarxiv icon

Tree-Based Hard Attention with Self-Motivation for Large Language Models

Feb 14, 2024
Chenxi Lin, Jiayu Ren, Guoxiu He, Zhuoren Jiang, Haiyan Yu, Xiaomin Zhu

Viaarxiv icon

Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition

Feb 19, 2024
Anna Martin-Boyle, Aahan Tyagi, Marti A. Hearst, Dongyeop Kang

Viaarxiv icon

MunTTS: A Text-to-Speech System for Mundari

Jan 28, 2024
Varun Gumma, Rishav Hada, Aditya Yadavalli, Pamir Gogoi, Ishani Mondal, Vivek Seshadri, Kalika Bali

Viaarxiv icon

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Jan 30, 2024
Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li

Viaarxiv icon

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

Jan 31, 2024
Xingning Dong, Zipeng Feng, Chunluan Zhou, Xuzheng Yu, Ming Yang, Qingpei Guo

Viaarxiv icon

UniGraph: Learning a Cross-Domain Graph Foundation Model From Natural Language

Feb 21, 2024
Yufei He, Bryan Hooi

Viaarxiv icon

InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write

Feb 21, 2024
Blagoj Mitrevski, Arina Rak, Julian Schnitzler, Chengkun Li, Andrii Maksai, Jesse Berent, Claudiu Musat

Viaarxiv icon

Deep adaptive sampling for surrogate modeling without labeled data

Feb 17, 2024
Xili Wang, Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

Viaarxiv icon

A Touch, Vision, and Language Dataset for Multimodal Alignment

Feb 20, 2024
Letian Fu, Gaurav Datta, Huang Huang, William Chung-Ho Panitch, Jaimyn Drake, Joseph Ortiz, Mustafa Mukadam, Mike Lambeta, Roberto Calandra, Ken Goldberg

Viaarxiv icon