Picture for Tianlin Li

Tianlin Li

MASteer: Multi-Agent Adaptive Steer Strategy for End-to-End LLM Trustworthiness Repair

Add code
Aug 09, 2025
Viaarxiv icon

Investigating Training Data Detection in AI Coders

Add code
Jul 23, 2025
Viaarxiv icon

Fair-PP: A Synthetic Dataset for Aligning LLM with Personalized Preferences of Social Equity

Add code
May 17, 2025
Viaarxiv icon

TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis

Add code
May 11, 2025
Viaarxiv icon

Software Development Life Cycle Perspective: A Survey of Benchmarks for CodeLLMs and Agents

Add code
May 08, 2025
Viaarxiv icon

A Vision for Auto Research with LLM Agents

Add code
Apr 26, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Defending LVLMs Against Vision Attacks through Partial-Perception Supervision

Add code
Dec 17, 2024
Figure 1 for Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
Figure 2 for Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
Figure 3 for Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
Figure 4 for Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
Viaarxiv icon

Benchmarking Bias in Large Language Models during Role-Playing

Add code
Nov 01, 2024
Figure 1 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 2 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 3 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 4 for Benchmarking Bias in Large Language Models during Role-Playing
Viaarxiv icon

Speculative Coreset Selection for Task-Specific Fine-tuning

Add code
Oct 02, 2024
Viaarxiv icon