Picture for Tianyi Wang

Tianyi Wang

Zhejiang University, Hangzhou, China

Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards

Add code
Apr 11, 2026
Viaarxiv icon

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Add code
Apr 10, 2026
Viaarxiv icon

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay

Add code
Mar 17, 2026
Viaarxiv icon

When to Lock Attention: Training-Free KV Control in Video Diffusion

Add code
Mar 10, 2026
Viaarxiv icon

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

Add code
Mar 03, 2026
Viaarxiv icon

Physics-informed Active Polarimetric 3D Imaging for Specular Surfaces

Add code
Feb 23, 2026
Viaarxiv icon

PT-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Papers

Add code
Feb 14, 2026
Viaarxiv icon

Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

Add code
Feb 11, 2026
Viaarxiv icon

D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning

Add code
Feb 08, 2026
Viaarxiv icon

Rethinking Latency Denial-of-Service: Attacking the LLM Serving Framework, Not the Model

Add code
Feb 08, 2026
Viaarxiv icon