Picture for Sheng Shen

Sheng Shen

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Add code
Jun 17, 2024
Viaarxiv icon

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Add code
May 30, 2024
Viaarxiv icon

DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion

Add code
Mar 31, 2024
Viaarxiv icon

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Add code
Mar 22, 2024
Viaarxiv icon

RAFT: Adapting Language Model to Domain Specific RAG

Add code
Mar 15, 2024
Figure 1 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 2 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 3 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 4 for RAFT: Adapting Language Model to Domain Specific RAG
Viaarxiv icon

Reinforcement Unlearning

Add code
Dec 26, 2023
Viaarxiv icon

Large Language Models are Visual Reasoning Coordinators

Add code
Oct 23, 2023
Figure 1 for Large Language Models are Visual Reasoning Coordinators
Figure 2 for Large Language Models are Visual Reasoning Coordinators
Figure 3 for Large Language Models are Visual Reasoning Coordinators
Figure 4 for Large Language Models are Visual Reasoning Coordinators
Viaarxiv icon

From Text to Tactic: Evaluating LLMs Playing the Game of Avalon

Add code
Oct 10, 2023
Figure 1 for From Text to Tactic: Evaluating LLMs Playing the Game of Avalon
Figure 2 for From Text to Tactic: Evaluating LLMs Playing the Game of Avalon
Figure 3 for From Text to Tactic: Evaluating LLMs Playing the Game of Avalon
Figure 4 for From Text to Tactic: Evaluating LLMs Playing the Game of Avalon
Viaarxiv icon

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Add code
Oct 03, 2023
Figure 1 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 2 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 3 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 4 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Viaarxiv icon

Aligning Large Multimodal Models with Factually Augmented RLHF

Add code
Sep 25, 2023
Figure 1 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 2 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 3 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 4 for Aligning Large Multimodal Models with Factually Augmented RLHF
Viaarxiv icon