Picture for Tao Zhang

Tao Zhang

Ordered Genetic Algorithm for Entrance Dependent Vehicle Routing Problem in Farms

Add code
Feb 26, 2025
Viaarxiv icon

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Add code
Feb 24, 2025
Viaarxiv icon

From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control

Add code
Feb 04, 2025
Figure 1 for From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Figure 2 for From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Figure 3 for From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Figure 4 for From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Viaarxiv icon

T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model

Add code
Feb 04, 2025
Figure 1 for T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model
Figure 2 for T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model
Figure 3 for T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model
Figure 4 for T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model
Viaarxiv icon

UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation

Add code
Feb 04, 2025
Figure 1 for UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Figure 2 for UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Figure 3 for UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Figure 4 for UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Viaarxiv icon

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Add code
Feb 03, 2025
Figure 1 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 2 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 3 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 4 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Add code
Jan 26, 2025
Figure 1 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 2 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 3 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 4 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Viaarxiv icon

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Add code
Jan 08, 2025
Viaarxiv icon

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Add code
Jan 07, 2025
Figure 1 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 2 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 3 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 4 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Viaarxiv icon