Picture for Feifei Zhao

Feifei Zhao

VESTA: A Fully Automated Scenario Generation and Safety Evaluation Framework for LLM Agents

Add code
Jun 07, 2026
Viaarxiv icon

CogManip: Benchmarking Manipulative Behavior in Multi-Turn Interactions with Large Language Model

Add code
Jun 04, 2026
Viaarxiv icon

Drug Synergy Prediction via Residual Graph Isomorphism Networks and Attention Mechanisms

Add code
Apr 23, 2026
Viaarxiv icon

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

Light Alignment Improves LLM Safety via Model Self-Reflection with a Single Neuron

Add code
Feb 02, 2026
Viaarxiv icon

TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers

Add code
Jan 26, 2026
Viaarxiv icon

CogToM: A Comprehensive Theory of Mind Benchmark inspired by Human Cognition for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values

Add code
Sep 09, 2025
Viaarxiv icon

Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society

Add code
Apr 24, 2025
Viaarxiv icon

Continual Learning of Multiple Cognitive Functions with Brain-inspired Temporal Development Mechanism

Add code
Apr 08, 2025
Viaarxiv icon