Picture for Taolin Zhang

Taolin Zhang

An Information-Theoretic Framework for Robust Large Language Model Editing

Add code
Dec 18, 2025
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Add code
Nov 11, 2025
Viaarxiv icon

Rethinking Verification for LLM Code Generation: From Generation to Testing

Add code
Jul 09, 2025
Viaarxiv icon

Coding Triangle: How Does Large Language Model Understand Code?

Add code
Jul 08, 2025
Viaarxiv icon

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Add code
May 26, 2025
Viaarxiv icon

UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models

Add code
May 18, 2025
Viaarxiv icon

BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering

Add code
May 17, 2025
Viaarxiv icon

Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability

Add code
May 02, 2025
Viaarxiv icon

A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions

Add code
Apr 12, 2025
Viaarxiv icon