Picture for Kevin Lin

Kevin Lin

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Add code
Mar 26, 2025
Viaarxiv icon

Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising

Add code
Mar 26, 2025
Figure 1 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 2 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 3 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 4 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Viaarxiv icon

ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning

Add code
Mar 25, 2025
Viaarxiv icon

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Add code
Mar 18, 2025
Viaarxiv icon

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Add code
Jan 31, 2025
Figure 1 for Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Figure 2 for Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Figure 3 for Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Figure 4 for Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Viaarxiv icon

GenXD: Generating Any 3D and 4D Scenes

Add code
Nov 05, 2024
Figure 1 for GenXD: Generating Any 3D and 4D Scenes
Figure 2 for GenXD: Generating Any 3D and 4D Scenes
Figure 3 for GenXD: Generating Any 3D and 4D Scenes
Figure 4 for GenXD: Generating Any 3D and 4D Scenes
Viaarxiv icon

LiVOS: Light Video Object Segmentation with Gated Linear Matching

Add code
Nov 05, 2024
Figure 1 for LiVOS: Light Video Object Segmentation with Gated Linear Matching
Figure 2 for LiVOS: Light Video Object Segmentation with Gated Linear Matching
Figure 3 for LiVOS: Light Video Object Segmentation with Gated Linear Matching
Figure 4 for LiVOS: Light Video Object Segmentation with Gated Linear Matching
Viaarxiv icon

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

Add code
Oct 31, 2024
Figure 1 for DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
Figure 2 for DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
Figure 3 for DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
Figure 4 for DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
Viaarxiv icon

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Add code
Oct 30, 2024
Figure 1 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 2 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 3 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 4 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Viaarxiv icon

Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration

Add code
Oct 17, 2024
Figure 1 for Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Figure 2 for Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Figure 3 for Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Figure 4 for Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Viaarxiv icon