Picture for Zhenfang Chen

Zhenfang Chen

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Add code
Nov 06, 2023
Figure 1 for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Figure 2 for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Figure 3 for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Figure 4 for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Viaarxiv icon

Sparse Universal Transformer

Add code
Oct 11, 2023
Figure 1 for Sparse Universal Transformer
Figure 2 for Sparse Universal Transformer
Figure 3 for Sparse Universal Transformer
Figure 4 for Sparse Universal Transformer
Viaarxiv icon

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions

Add code
Oct 10, 2023
Figure 1 for TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Figure 2 for TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Figure 3 for TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Figure 4 for TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Viaarxiv icon

SALMON: Self-Alignment with Principle-Following Reward Models

Add code
Oct 09, 2023
Figure 1 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 2 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 3 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 4 for SALMON: Self-Alignment with Principle-Following Reward Models
Viaarxiv icon

3D-LLM: Injecting the 3D World into Large Language Models

Add code
Jul 24, 2023
Viaarxiv icon

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties

Add code
Jun 27, 2023
Viaarxiv icon

ModuleFormer: Learning Modular Large Language Models From Uncurated Data

Add code
Jun 07, 2023
Figure 1 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 2 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 3 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 4 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Viaarxiv icon

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

Add code
May 04, 2023
Figure 1 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 2 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 3 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 4 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Viaarxiv icon

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

Add code
Apr 07, 2023
Viaarxiv icon

Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention

Add code
Apr 06, 2023
Viaarxiv icon