Picture for Chao Qu

Chao Qu

INF Technology

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Add code
Aug 07, 2025
Figure 1 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 2 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 3 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 4 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Viaarxiv icon

Equivariant Spherical Transformer for Efficient Molecular Modeling

Add code
May 29, 2025
Figure 1 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 2 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 3 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 4 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Viaarxiv icon

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Add code
Apr 10, 2025
Viaarxiv icon

Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging

Add code
Mar 05, 2025
Figure 1 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 2 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 3 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 4 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Viaarxiv icon

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

Add code
Feb 17, 2025
Figure 1 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 2 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 3 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 4 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Viaarxiv icon

Equivariant Masked Position Prediction for Efficient Molecular Representation

Add code
Feb 12, 2025
Figure 1 for Equivariant Masked Position Prediction for Efficient Molecular Representation
Figure 2 for Equivariant Masked Position Prediction for Efficient Molecular Representation
Figure 3 for Equivariant Masked Position Prediction for Efficient Molecular Representation
Figure 4 for Equivariant Masked Position Prediction for Efficient Molecular Representation
Viaarxiv icon

Refine Knowledge of Large Language Models via Adaptive Contrastive Learning

Add code
Feb 11, 2025
Viaarxiv icon

SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

Add code
Jan 26, 2025
Figure 1 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Figure 2 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Viaarxiv icon

CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks

Add code
Sep 05, 2024
Figure 1 for CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks
Figure 2 for CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks
Viaarxiv icon