Picture for Akash Gokul

Akash Gokul

MobileWorldBench: Towards Semantic World Modeling For Mobile Agents

Add code
Dec 16, 2025
Viaarxiv icon

Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement

Add code
Nov 08, 2025
Figure 1 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 2 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 3 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 4 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Viaarxiv icon

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Add code
May 22, 2025
Viaarxiv icon

xGen-small Technical Report

Add code
May 10, 2025
Figure 1 for xGen-small Technical Report
Figure 2 for xGen-small Technical Report
Figure 3 for xGen-small Technical Report
Figure 4 for xGen-small Technical Report
Viaarxiv icon

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Add code
Mar 15, 2025
Viaarxiv icon

OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Add code
Dec 02, 2024
Figure 1 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 2 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 3 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 4 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Viaarxiv icon

Aligning Diffusion Models by Optimizing Human Utility

Add code
Apr 06, 2024
Figure 1 for Aligning Diffusion Models by Optimizing Human Utility
Figure 2 for Aligning Diffusion Models by Optimizing Human Utility
Figure 3 for Aligning Diffusion Models by Optimizing Human Utility
Figure 4 for Aligning Diffusion Models by Optimizing Human Utility
Viaarxiv icon

BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models

Add code
Jan 25, 2024
Figure 1 for BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Figure 2 for BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Figure 3 for BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Figure 4 for BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Viaarxiv icon

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Add code
Oct 13, 2023
Viaarxiv icon

REX: Rapid Exploration and eXploitation for AI Agents

Add code
Jul 18, 2023
Figure 1 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 2 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 3 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 4 for REX: Rapid Exploration and eXploitation for AI Agents
Viaarxiv icon