Picture for Jing Shao

Jing Shao

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

Loop as a Bridge: Can Looped Transformers Truly Link Representation Space and Natural Language Outputs?

Add code
Jan 15, 2026
Viaarxiv icon

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Add code
Jan 15, 2026
Viaarxiv icon

ProGuard: Towards Proactive Multimodal Safeguard

Add code
Dec 29, 2025
Viaarxiv icon

Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard

Add code
Nov 14, 2025
Figure 1 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 2 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 3 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 4 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Viaarxiv icon

When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms

Add code
Nov 09, 2025
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models

Add code
Sep 30, 2025
Figure 1 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 2 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 3 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 4 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Viaarxiv icon

The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations

Add code
Sep 16, 2025
Viaarxiv icon

Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

Add code
Sep 04, 2025
Figure 1 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 2 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 3 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 4 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Viaarxiv icon