Picture for Zeen Song

Zeen Song

From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

Add code
Mar 03, 2026
Viaarxiv icon

Closing the Loop: A Control-Theoretic Framework for Provably Stable Time Series Forecasting with LLMs

Add code
Feb 13, 2026
Viaarxiv icon

Adaptive Uncertainty-Aware Tree Search for Robust Reasoning

Add code
Feb 06, 2026
Viaarxiv icon

Causal Front-Door Adjustment for Robust Jailbreak Attacks on LLMs

Add code
Feb 05, 2026
Viaarxiv icon

Group Causal Policy Optimization for Post-Training Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction

Add code
Aug 06, 2025
Viaarxiv icon

Reward Model Generalization for Compute-Aware Test-Time Reasoning

Add code
May 23, 2025
Viaarxiv icon

Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs

Add code
May 15, 2025
Viaarxiv icon

On the Generalization and Causal Explanation in Self-Supervised Learning

Add code
Oct 01, 2024
Viaarxiv icon

On the Discriminability of Self-Supervised Representation Learning

Add code
Jul 18, 2024
Viaarxiv icon