Picture for Fei Huang

Fei Huang

additional authors not shown

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

Add code
Sep 05, 2024
Figure 1 for mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
Figure 2 for mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
Figure 3 for mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
Figure 4 for mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
Viaarxiv icon

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

Add code
Aug 27, 2024
Figure 1 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 2 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 3 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 4 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Viaarxiv icon

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model

Add code
Aug 26, 2024
Viaarxiv icon

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

Add code
Aug 20, 2024
Figure 1 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 2 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 3 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 4 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Viaarxiv icon

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Add code
Aug 09, 2024
Figure 1 for mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
Figure 2 for mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
Figure 3 for mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
Figure 4 for mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
Viaarxiv icon

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement

Add code
Aug 06, 2024
Figure 1 for Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Figure 2 for Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Figure 3 for Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Figure 4 for Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Viaarxiv icon

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

Add code
Jul 29, 2024
Figure 1 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 2 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 3 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 4 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Viaarxiv icon

Knowledge Mechanisms in Large Language Models: A Survey and Perspective

Add code
Jul 22, 2024
Figure 1 for Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Figure 2 for Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Figure 3 for Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Figure 4 for Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Viaarxiv icon

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Add code
Jul 21, 2024
Figure 1 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 2 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 3 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 4 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Viaarxiv icon

Visual Text Generation in the Wild

Add code
Jul 19, 2024
Viaarxiv icon