Picture for Sai Rajeswar

Sai Rajeswar

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Figure 1 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 2 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 3 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 4 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Viaarxiv icon

Representing Positional Information in Generative World Models for Object Manipulation

Add code
Sep 19, 2024
Figure 1 for Representing Positional Information in Generative World Models for Object Manipulation
Figure 2 for Representing Positional Information in Generative World Models for Object Manipulation
Figure 3 for Representing Positional Information in Generative World Models for Object Manipulation
Figure 4 for Representing Positional Information in Generative World Models for Object Manipulation
Viaarxiv icon

Multimodal foundation world models for generalist embodied agents

Add code
Jun 26, 2024
Viaarxiv icon

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Add code
Jun 17, 2024
Viaarxiv icon

VCR: Visual Caption Restoration

Add code
Jun 10, 2024
Figure 1 for VCR: Visual Caption Restoration
Figure 2 for VCR: Visual Caption Restoration
Figure 3 for VCR: Visual Caption Restoration
Figure 4 for VCR: Visual Caption Restoration
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

Equivariant Adaptation of Large Pre-Trained Models

Add code
Oct 02, 2023
Figure 1 for Equivariant Adaptation of Large Pre-Trained Models
Figure 2 for Equivariant Adaptation of Large Pre-Trained Models
Figure 3 for Equivariant Adaptation of Large Pre-Trained Models
Figure 4 for Equivariant Adaptation of Large Pre-Trained Models
Viaarxiv icon

Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

Add code
Jul 12, 2023
Figure 1 for Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Figure 2 for Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Figure 3 for Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Figure 4 for Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Viaarxiv icon

Choreographer: Learning and Adapting Skills in Imagination

Add code
Nov 23, 2022
Viaarxiv icon

Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Add code
Sep 24, 2022
Figure 1 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 2 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 3 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 4 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Viaarxiv icon