Picture for Yin Cui

Yin Cui

DuoGen: Towards General Purpose Interleaved Multimodal Generation

Add code
Feb 03, 2026
Viaarxiv icon

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Add code
May 23, 2025
Viaarxiv icon

Describe Anything: Detailed Localized Image and Video Captioning

Add code
Apr 22, 2025
Viaarxiv icon

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Add code
Mar 18, 2025
Viaarxiv icon

Cosmos World Foundation Model Platform for Physical AI

Add code
Jan 07, 2025
Figure 1 for Cosmos World Foundation Model Platform for Physical AI
Figure 2 for Cosmos World Foundation Model Platform for Physical AI
Figure 3 for Cosmos World Foundation Model Platform for Physical AI
Figure 4 for Cosmos World Foundation Model Platform for Physical AI
Viaarxiv icon

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Add code
Nov 11, 2024
Viaarxiv icon

Edify 3D: Scalable High-Quality 3D Asset Generation

Add code
Nov 11, 2024
Figure 1 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 2 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 3 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 4 for Edify 3D: Scalable High-Quality 3D Asset Generation
Viaarxiv icon

Why Fine-grained Labels in Pretraining Benefit Generalization?

Add code
Oct 30, 2024
Figure 1 for Why Fine-grained Labels in Pretraining Benefit Generalization?
Figure 2 for Why Fine-grained Labels in Pretraining Benefit Generalization?
Figure 3 for Why Fine-grained Labels in Pretraining Benefit Generalization?
Figure 4 for Why Fine-grained Labels in Pretraining Benefit Generalization?
Viaarxiv icon

Wolf: Captioning Everything with a World Summarization Framework

Add code
Jul 26, 2024
Figure 1 for Wolf: Captioning Everything with a World Summarization Framework
Figure 2 for Wolf: Captioning Everything with a World Summarization Framework
Figure 3 for Wolf: Captioning Everything with a World Summarization Framework
Figure 4 for Wolf: Captioning Everything with a World Summarization Framework
Viaarxiv icon

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Add code
Apr 30, 2024
Figure 1 for Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Figure 2 for Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Figure 3 for Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Figure 4 for Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Viaarxiv icon