Picture for Adam Yala

Adam Yala

Describe Anything: Detailed Localized Image and Video Captioning

Add code
Apr 22, 2025
Viaarxiv icon

Learning Adaptive Parallel Reasoning with Language Models

Add code
Apr 21, 2025
Viaarxiv icon

TULIP: Towards Unified Language-Image Pretraining

Add code
Mar 19, 2025
Viaarxiv icon

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Add code
Mar 16, 2025
Viaarxiv icon

Rethinking Patch Dependence for Masked Autoencoders

Add code
Jan 25, 2024
Viaarxiv icon

LLM-grounded Video Diffusion Models

Add code
Oct 02, 2023
Viaarxiv icon

Conformal Language Modeling

Add code
Jun 16, 2023
Viaarxiv icon

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

Add code
May 23, 2023
Figure 1 for LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Figure 2 for LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Figure 3 for LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Figure 4 for LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Viaarxiv icon

PEOPL: Characterizing Privately Encoded Open Datasets with Public Labels

Add code
Mar 31, 2023
Viaarxiv icon

AI Gone Astray: Technical Supplement

Add code
Mar 01, 2022
Figure 1 for AI Gone Astray: Technical Supplement
Figure 2 for AI Gone Astray: Technical Supplement
Figure 3 for AI Gone Astray: Technical Supplement
Figure 4 for AI Gone Astray: Technical Supplement
Viaarxiv icon