Picture for Siqian Tong

Siqian Tong

PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding

Add code
Feb 24, 2026
Viaarxiv icon

AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

Add code
Feb 14, 2026
Viaarxiv icon

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

Add code
Nov 18, 2025
Figure 1 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 2 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 3 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 4 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Viaarxiv icon