Picture for Haoze Sun

Haoze Sun

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Add code
Apr 09, 2026
Viaarxiv icon

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Add code
Mar 29, 2026
Viaarxiv icon

GIFT: Unlocking Global Optimality in Post-Training via Finite-Temperature Gibbs Initialization

Add code
Jan 14, 2026
Viaarxiv icon

S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models

Add code
May 20, 2025
Figure 1 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 2 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 3 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Figure 4 for S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models
Viaarxiv icon

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Add code
Apr 20, 2025
Figure 1 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 2 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 3 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 4 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Viaarxiv icon

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Add code
Mar 27, 2025
Figure 1 for ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Figure 2 for ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Figure 3 for ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Figure 4 for ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Viaarxiv icon

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Add code
Mar 19, 2025
Viaarxiv icon

Pixel to Gaussian: Ultra-Fast Continuous Super-Resolution with 2D Gaussian Modeling

Add code
Mar 09, 2025
Viaarxiv icon

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Add code
Feb 24, 2025
Figure 1 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 2 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 3 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Figure 4 for Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Figure 1 for Baichuan-Omni-1.5 Technical Report
Figure 2 for Baichuan-Omni-1.5 Technical Report
Figure 3 for Baichuan-Omni-1.5 Technical Report
Figure 4 for Baichuan-Omni-1.5 Technical Report
Viaarxiv icon