Picture for Jiaheng Zhang

Jiaheng Zhang

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

Hijacking Large Audio-Language Models via Context-Agnostic and Imperceptible Auditory Prompt Injection

Add code
Apr 16, 2026
Viaarxiv icon

STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling

Add code
Mar 18, 2026
Viaarxiv icon

Gym-V: A Unified Vision Environment System for Agentic Vision Research

Add code
Mar 17, 2026
Viaarxiv icon

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Add code
Mar 03, 2026
Viaarxiv icon

IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

Add code
Feb 26, 2026
Viaarxiv icon

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Add code
Feb 19, 2026
Viaarxiv icon

On Representation Redundancy in Large-Scale Instruction Tuning Data Selection

Add code
Feb 14, 2026
Viaarxiv icon

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Add code
Feb 07, 2026
Viaarxiv icon

MemPot: Defending Against Memory Extraction Attack with Optimized Honeypots

Add code
Feb 07, 2026
Viaarxiv icon