Picture for Qirui Zheng

Qirui Zheng

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

Add code
Jun 01, 2026
Viaarxiv icon

Beyond Autoregressive RTG: Conditioning via Injection Outside Sequential Modeling in Decision Transformer

Add code
May 07, 2026
Viaarxiv icon

Decoupling Return-to-Go for Efficient Decision Transformer

Add code
Jan 22, 2026
Viaarxiv icon

InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

Add code
May 29, 2025
Viaarxiv icon

Single-Pass Document Scanning for Question Answering

Add code
Apr 04, 2025
Figure 1 for Single-Pass Document Scanning for Question Answering
Figure 2 for Single-Pass Document Scanning for Question Answering
Figure 3 for Single-Pass Document Scanning for Question Answering
Figure 4 for Single-Pass Document Scanning for Question Answering
Viaarxiv icon