Alert button

"Text": models, code, and papers
Alert button

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Feb 12, 2024
Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar

Viaarxiv icon

LLM-Oriented Retrieval Tuner

Mar 04, 2024
Si Sun, Hanqing Zhang, Zhiyuan Liu, Jie Bao, Dawei Song

Figure 1 for LLM-Oriented Retrieval Tuner
Figure 2 for LLM-Oriented Retrieval Tuner
Figure 3 for LLM-Oriented Retrieval Tuner
Figure 4 for LLM-Oriented Retrieval Tuner
Viaarxiv icon

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

Mar 05, 2024
Junwen He, Yifan Wang, Lijun Wang, Huchuan Lu, Jun-Yan He, Jin-Peng Lan, Bin Luo, Xuansong Xie

Figure 1 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 2 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 3 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 4 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Viaarxiv icon

Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition

Mar 05, 2024
Yutian Liu, Wenjun Ke, Jianguo Wei

Viaarxiv icon

Artificial Intelligence Exploring the Patent Field

Mar 06, 2024
Lekang Jiang, Stephan Goetz

Figure 1 for Artificial Intelligence Exploring the Patent Field
Figure 2 for Artificial Intelligence Exploring the Patent Field
Figure 3 for Artificial Intelligence Exploring the Patent Field
Figure 4 for Artificial Intelligence Exploring the Patent Field
Viaarxiv icon

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Mar 02, 2024
Shanchuan Lin, Anran Wang, Xiao Yang

Viaarxiv icon

CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions

Mar 01, 2024
Leane Jourdan, Florian Boudin, Nicolas Hernandez, Richard Dufour

Figure 1 for CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
Figure 2 for CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
Figure 3 for CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
Figure 4 for CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
Viaarxiv icon

Distilling Large Language Models for Text-Attributed Graph Learning

Feb 19, 2024
Bo Pan, Zheng Zhang, Yifei Zhang, Yuntong Hu, Liang Zhao

Viaarxiv icon

Exploring Precision and Recall to assess the quality and diversity of LLMs

Feb 28, 2024
Florian Le Bronnec, Alexandre Verine, Benjamin Negrevergne, Yann Chevaleyre, Alexandre Allauzen

Viaarxiv icon

DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory

Mar 04, 2024
Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu

Figure 1 for DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory
Figure 2 for DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory
Figure 3 for DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory
Figure 4 for DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory
Viaarxiv icon