Picture for Tianyu Pang

Tianyu Pang

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Add code
Jun 05, 2025
Viaarxiv icon

Fostering Video Reasoning via Next-Event Prediction

Add code
May 28, 2025
Viaarxiv icon

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Add code
May 27, 2025
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

Lifelong Safety Alignment for Language Models

Add code
May 26, 2025
Viaarxiv icon

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Add code
May 22, 2025
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Viaarxiv icon

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Add code
May 19, 2025
Viaarxiv icon

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon