Picture for Xu Zhao

Xu Zhao

Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network

Add code
Aug 18, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Viaarxiv icon

Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction

Add code
Jun 06, 2025
Viaarxiv icon

DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction

Add code
Apr 10, 2025
Viaarxiv icon

Bayesian Optimization for Controlled Image Editing via LLMs

Add code
Feb 26, 2025
Figure 1 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 2 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 3 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 4 for Bayesian Optimization for Controlled Image Editing via LLMs
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Systematic Outliers in Large Language Models

Add code
Feb 10, 2025
Figure 1 for Systematic Outliers in Large Language Models
Figure 2 for Systematic Outliers in Large Language Models
Figure 3 for Systematic Outliers in Large Language Models
Figure 4 for Systematic Outliers in Large Language Models
Viaarxiv icon

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

Add code
Dec 20, 2024
Figure 1 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 2 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 3 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 4 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Viaarxiv icon

Monocular Lane Detection Based on Deep Learning: A Survey

Add code
Nov 26, 2024
Figure 1 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 2 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 3 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 4 for Monocular Lane Detection Based on Deep Learning: A Survey
Viaarxiv icon