Picture for Dongrui Liu

Dongrui Liu

Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

Add code
May 20, 2026
Viaarxiv icon

Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

Add code
May 20, 2026
Viaarxiv icon

Focused Forcing: Content-Aware Per-Frame KV Selection for Efficient Autoregressive Video Diffusion

Add code
May 18, 2026
Viaarxiv icon

TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Add code
May 10, 2026
Viaarxiv icon

On the Blessing of Pre-training in Weak-to-Strong Generalization

Add code
May 07, 2026
Viaarxiv icon

Multilingual Safety Alignment via Self-Distillation

Add code
May 03, 2026
Viaarxiv icon

Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX

Add code
Apr 16, 2026
Viaarxiv icon

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Add code
Apr 08, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis

Add code
Apr 08, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Add code
Apr 02, 2026
Viaarxiv icon