Picture for Jun Sun

Jun Sun

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Propaganda via AI? A Study on Semantic Backdoors in Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents

Add code
Mar 24, 2025
Viaarxiv icon

PALo: Learning Posture-Aware Locomotion for Quadruped Robots

Add code
Mar 06, 2025
Viaarxiv icon

Verification of Bit-Flip Attacks against Quantized Neural Networks

Add code
Feb 22, 2025
Viaarxiv icon

Universal Semantic Embeddings of Chemical Elements for Enhanced Materials Inference and Discovery

Add code
Feb 19, 2025
Viaarxiv icon

Democratic Training Against Universal Adversarial Perturbations

Add code
Feb 08, 2025
Viaarxiv icon

Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

Add code
Dec 17, 2024
Figure 1 for Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Figure 2 for Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Figure 3 for Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Figure 4 for Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Viaarxiv icon

The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap

Add code
Dec 09, 2024
Figure 1 for The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap
Figure 2 for The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap
Figure 3 for The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap
Viaarxiv icon

CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization

Add code
Nov 18, 2024
Figure 1 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 2 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 3 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 4 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Viaarxiv icon