Picture for Yitao Liang

Yitao Liang

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Add code
Jun 27, 2024
Viaarxiv icon

CLoG: Benchmarking Continual Learning of Image Generation Models

Add code
Jun 07, 2024
Figure 1 for CLoG: Benchmarking Continual Learning of Image Generation Models
Figure 2 for CLoG: Benchmarking Continual Learning of Image Generation Models
Figure 3 for CLoG: Benchmarking Continual Learning of Image Generation Models
Figure 4 for CLoG: Benchmarking Continual Learning of Image Generation Models
Viaarxiv icon

Semantic Loss Functions for Neuro-Symbolic Structured Prediction

Add code
May 12, 2024
Figure 1 for Semantic Loss Functions for Neuro-Symbolic Structured Prediction
Figure 2 for Semantic Loss Functions for Neuro-Symbolic Structured Prediction
Figure 3 for Semantic Loss Functions for Neuro-Symbolic Structured Prediction
Figure 4 for Semantic Loss Functions for Neuro-Symbolic Structured Prediction
Viaarxiv icon

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Add code
Mar 08, 2024
Figure 1 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 2 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 3 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 4 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Viaarxiv icon

DIGIC: Domain Generalizable Imitation Learning by Causal Discovery

Add code
Feb 29, 2024
Viaarxiv icon

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

Add code
Feb 04, 2024
Figure 1 for Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Figure 2 for Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Figure 3 for Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Figure 4 for Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Viaarxiv icon

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Add code
Nov 30, 2023
Viaarxiv icon

Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective

Add code
Oct 31, 2023
Figure 1 for Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective
Figure 2 for Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective
Figure 3 for Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective
Figure 4 for Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective
Viaarxiv icon

MCU: A Task-centric Framework for Open-ended Agent Evaluation in Minecraft

Add code
Oct 12, 2023
Viaarxiv icon

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

Add code
Oct 12, 2023
Viaarxiv icon