Picture for Yuning Chai

Yuning Chai

Agentic Very Long Video Understanding

Add code
Jan 26, 2026
Viaarxiv icon

Reading Recognition in the Wild

Add code
May 30, 2025
Viaarxiv icon

Generative Data Mining with Longtail-Guided Diffusion

Add code
Feb 04, 2025
Figure 1 for Generative Data Mining with Longtail-Guided Diffusion
Figure 2 for Generative Data Mining with Longtail-Guided Diffusion
Figure 3 for Generative Data Mining with Longtail-Guided Diffusion
Figure 4 for Generative Data Mining with Longtail-Guided Diffusion
Viaarxiv icon

DriveGPT: Scaling Autoregressive Behavior Models for Driving

Add code
Dec 19, 2024
Figure 1 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 2 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 3 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 4 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Viaarxiv icon

PROFIT: A Specialized Optimizer for Deep Fine Tuning

Add code
Dec 09, 2024
Figure 1 for PROFIT: A Specialized Optimizer for Deep Fine Tuning
Figure 2 for PROFIT: A Specialized Optimizer for Deep Fine Tuning
Figure 3 for PROFIT: A Specialized Optimizer for Deep Fine Tuning
Figure 4 for PROFIT: A Specialized Optimizer for Deep Fine Tuning
Viaarxiv icon

PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning

Add code
Dec 02, 2024
Figure 1 for PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning
Figure 2 for PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning
Figure 3 for PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning
Figure 4 for PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning
Viaarxiv icon

Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving

Add code
Feb 23, 2024
Figure 1 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 2 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 3 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 4 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Viaarxiv icon

Making Large Multimodal Models Understand Arbitrary Visual Prompts

Add code
Dec 01, 2023
Figure 1 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 2 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 3 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 4 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Viaarxiv icon

SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors

Add code
Sep 11, 2023
Figure 1 for SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors
Figure 2 for SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors
Figure 3 for SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors
Figure 4 for SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors
Viaarxiv icon

NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

Add code
Aug 24, 2023
Viaarxiv icon