Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Add code
Aug 18, 2025
Viaarxiv icon

IOCC: Aligning Semantic and Cluster Centers for Few-shot Short Text Clustering

Add code
Aug 08, 2025
Viaarxiv icon

Adaptive Backtracking for Privacy Protection in Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

MMSearch-R1: Incentivizing LMMs to Search

Add code
Jun 25, 2025
Viaarxiv icon

RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories

Add code
Jun 18, 2025
Viaarxiv icon

Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models

Add code
Jun 16, 2025
Viaarxiv icon

Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs

Add code
Jun 12, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Viaarxiv icon

HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions

Add code
May 29, 2025
Viaarxiv icon