Picture for Chen Sun

Chen Sun

Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead

Add code
Nov 16, 2023
Viaarxiv icon

Towards A Unified Neural Architecture for Visual Recognition and Reasoning

Add code
Nov 10, 2023
Figure 1 for Towards A Unified Neural Architecture for Visual Recognition and Reasoning
Figure 2 for Towards A Unified Neural Architecture for Visual Recognition and Reasoning
Figure 3 for Towards A Unified Neural Architecture for Visual Recognition and Reasoning
Figure 4 for Towards A Unified Neural Architecture for Visual Recognition and Reasoning
Viaarxiv icon

Analyzing Modular Approaches for Visual Question Decomposition

Add code
Nov 10, 2023
Figure 1 for Analyzing Modular Approaches for Visual Question Decomposition
Figure 2 for Analyzing Modular Approaches for Visual Question Decomposition
Figure 3 for Analyzing Modular Approaches for Visual Question Decomposition
Figure 4 for Analyzing Modular Approaches for Visual Question Decomposition
Viaarxiv icon

Emergence of Abstract State Representations in Embodied Sequence Modeling

Add code
Nov 07, 2023
Viaarxiv icon

Object-centric Video Representation for Long-term Action Anticipation

Add code
Oct 31, 2023
Figure 1 for Object-centric Video Representation for Long-term Action Anticipation
Figure 2 for Object-centric Video Representation for Long-term Action Anticipation
Figure 3 for Object-centric Video Representation for Long-term Action Anticipation
Figure 4 for Object-centric Video Representation for Long-term Action Anticipation
Viaarxiv icon

Discrete, compositional, and symbolic representations through attractor dynamics

Add code
Oct 03, 2023
Viaarxiv icon

Delta-AI: Local objectives for amortized inference in sparse graphical models

Add code
Oct 03, 2023
Viaarxiv icon

Evaluating the Generation Capabilities of Large Chinese Language Models

Add code
Aug 11, 2023
Figure 1 for Evaluating the Generation Capabilities of Large Chinese Language Models
Figure 2 for Evaluating the Generation Capabilities of Large Chinese Language Models
Figure 3 for Evaluating the Generation Capabilities of Large Chinese Language Models
Viaarxiv icon

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Add code
Jul 31, 2023
Viaarxiv icon

Does Visual Pretraining Help End-to-End Reasoning?

Add code
Jul 17, 2023
Figure 1 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 2 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 3 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 4 for Does Visual Pretraining Help End-to-End Reasoning?
Viaarxiv icon