Picture for Xiaotong Ji

Xiaotong Ji

The Model Knows, the Decoder Finds: Future Value Guided Particle Power Sampling

Add code
May 04, 2026
Viaarxiv icon

The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

Add code
Mar 20, 2026
Viaarxiv icon

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Add code
Feb 05, 2026
Viaarxiv icon

Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening

Add code
Jan 29, 2026
Viaarxiv icon

KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding

Add code
Dec 16, 2025
Viaarxiv icon

Enhancing Reliability of Medical Image Diagnosis through Top-rank Learning with Rejection Module

Add code
Aug 11, 2025
Viaarxiv icon

Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving

Add code
Jul 03, 2025
Viaarxiv icon

Robust Probabilistic Model Checking with Continuous Reward Domains

Add code
Feb 06, 2025
Figure 1 for Robust Probabilistic Model Checking with Continuous Reward Domains
Figure 2 for Robust Probabilistic Model Checking with Continuous Reward Domains
Figure 3 for Robust Probabilistic Model Checking with Continuous Reward Domains
Figure 4 for Robust Probabilistic Model Checking with Continuous Reward Domains
Viaarxiv icon

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Add code
Feb 03, 2025
Viaarxiv icon

Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version)

Add code
Jul 12, 2023
Viaarxiv icon