Picture for Bin Xu

Bin Xu

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Add code
Feb 06, 2024
Figure 1 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 2 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 3 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 4 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Viaarxiv icon

CogAgent: A Visual Language Model for GUI Agents

Add code
Dec 21, 2023
Figure 1 for CogAgent: A Visual Language Model for GUI Agents
Figure 2 for CogAgent: A Visual Language Model for GUI Agents
Figure 3 for CogAgent: A Visual Language Model for GUI Agents
Figure 4 for CogAgent: A Visual Language Model for GUI Agents
Viaarxiv icon

When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks

Add code
Nov 15, 2023
Viaarxiv icon

CogVLM: Visual Expert for Pretrained Language Models

Add code
Nov 06, 2023
Viaarxiv icon

Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping

Add code
Oct 26, 2023
Figure 1 for Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
Figure 2 for Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
Figure 3 for Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
Figure 4 for Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
Viaarxiv icon

Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment

Add code
Oct 16, 2023
Viaarxiv icon

BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation

Add code
Oct 16, 2023
Viaarxiv icon

ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation

Add code
Aug 31, 2023
Viaarxiv icon

Towards General Low-Light Raw Noise Synthesis and Modeling

Add code
Aug 17, 2023
Figure 1 for Towards General Low-Light Raw Noise Synthesis and Modeling
Figure 2 for Towards General Low-Light Raw Noise Synthesis and Modeling
Figure 3 for Towards General Low-Light Raw Noise Synthesis and Modeling
Figure 4 for Towards General Low-Light Raw Noise Synthesis and Modeling
Viaarxiv icon

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Add code
Jun 15, 2023
Figure 1 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 2 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 3 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 4 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Viaarxiv icon