Picture for Song-Chun Zhu

Song-Chun Zhu

University of California, Los Angeles

BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts

Add code
Dec 31, 2025
Viaarxiv icon

Generative Actor Critic

Add code
Dec 25, 2025
Viaarxiv icon

TongSIM: A General Platform for Simulating Intelligent Machines

Add code
Dec 23, 2025
Viaarxiv icon

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Add code
Dec 19, 2025
Viaarxiv icon

Social World Model-Augmented Mechanism Design Policy Learning

Add code
Oct 22, 2025
Viaarxiv icon

LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation

Add code
Jun 11, 2025
Figure 1 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 2 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 3 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 4 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Viaarxiv icon

SIV-Bench: A Video Benchmark for Social Interaction Understanding and Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

Add code
May 21, 2025
Figure 1 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 2 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 3 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 4 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon