Picture for Tianyu Pang

Tianyu Pang

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Add code
Jun 05, 2025
Viaarxiv icon

Fostering Video Reasoning via Next-Event Prediction

Add code
May 28, 2025
Figure 1 for Fostering Video Reasoning via Next-Event Prediction
Figure 2 for Fostering Video Reasoning via Next-Event Prediction
Figure 3 for Fostering Video Reasoning via Next-Event Prediction
Figure 4 for Fostering Video Reasoning via Next-Event Prediction
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Add code
May 27, 2025
Viaarxiv icon

Lifelong Safety Alignment for Language Models

Add code
May 26, 2025
Figure 1 for Lifelong Safety Alignment for Language Models
Figure 2 for Lifelong Safety Alignment for Language Models
Figure 3 for Lifelong Safety Alignment for Language Models
Figure 4 for Lifelong Safety Alignment for Language Models
Viaarxiv icon

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Add code
May 22, 2025
Figure 1 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 2 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 3 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Figure 4 for QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Figure 1 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 2 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 3 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 4 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Viaarxiv icon

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Add code
May 19, 2025
Figure 1 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 2 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 3 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 4 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Viaarxiv icon

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Figure 1 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 2 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 3 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 4 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon