Picture for Chao Song

Chao Song

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Add code
Mar 23, 2026
Viaarxiv icon

VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning

Add code
Oct 01, 2025
Figure 1 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 2 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 3 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 4 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Viaarxiv icon

Linear Preference Optimization: Decoupled Gradient Control via Absolute Regularization

Add code
Aug 20, 2025
Viaarxiv icon

Noise Analysis and Hierarchical Adaptive Body State Estimator For Biped Robot Walking With ESVC Foot

Add code
Jun 10, 2025
Figure 1 for Noise Analysis and Hierarchical Adaptive Body State Estimator For Biped Robot Walking With ESVC Foot
Figure 2 for Noise Analysis and Hierarchical Adaptive Body State Estimator For Biped Robot Walking With ESVC Foot
Figure 3 for Noise Analysis and Hierarchical Adaptive Body State Estimator For Biped Robot Walking With ESVC Foot
Figure 4 for Noise Analysis and Hierarchical Adaptive Body State Estimator For Biped Robot Walking With ESVC Foot
Viaarxiv icon

Model Analysis And Design Of Ellipse Based Segmented Varying Curved Foot For Biped Robot Walking

Add code
Jun 08, 2025
Viaarxiv icon

Diffusion Models for Molecules: A Survey of Methods and Tasks

Add code
Feb 13, 2025
Figure 1 for Diffusion Models for Molecules: A Survey of Methods and Tasks
Figure 2 for Diffusion Models for Molecules: A Survey of Methods and Tasks
Figure 3 for Diffusion Models for Molecules: A Survey of Methods and Tasks
Figure 4 for Diffusion Models for Molecules: A Survey of Methods and Tasks
Viaarxiv icon

Probing many-body Bell correlation depth with superconducting qubits

Add code
Jun 25, 2024
Figure 1 for Probing many-body Bell correlation depth with superconducting qubits
Figure 2 for Probing many-body Bell correlation depth with superconducting qubits
Figure 3 for Probing many-body Bell correlation depth with superconducting qubits
Viaarxiv icon

A Framework to Implement 1+N Multi-task Fine-tuning Pattern in LLMs Using the CGC-LORA Algorithm

Add code
Jan 22, 2024
Viaarxiv icon

AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking

Add code
Oct 24, 2023
Figure 1 for AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Figure 2 for AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Figure 3 for AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Figure 4 for AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Viaarxiv icon