Picture for Ruizhong Qiu

Ruizhong Qiu

TRIMS: Trajectory-Ranked Instruction Masked Supervision for Diffusion Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR

Add code
Mar 25, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon

Graph homophily booster: Reimagining the role of discrete features in heterophilic graph learning

Add code
Feb 06, 2026
Viaarxiv icon

TSAQA: Time Series Analysis Question And Answering Benchmark

Add code
Jan 30, 2026
Viaarxiv icon

Agentic Reasoning for Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

Subspace Alignment for Vision-Language Model Test-time Adaptation

Add code
Jan 13, 2026
Viaarxiv icon

AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs

Add code
Jan 09, 2026
Viaarxiv icon

Don't Waste It: Guiding Generative Recommenders with Structured Human Priors via Multi-head Decoding

Add code
Nov 16, 2025
Viaarxiv icon

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

Add code
Oct 01, 2025
Viaarxiv icon