Picture for Chenguang Wang

Chenguang Wang

Michael Pokorny

MIRAGE: A Polarity-Flipping Encoding Subspace in LLM Agents

Add code
Jun 09, 2026
Viaarxiv icon

Ishigaki-IDS: An Open-Weight Verifier-Aware Model for Information Delivery Specification Drafting in Building Information Modeling

Add code
Jun 07, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

Add code
Jun 03, 2026
Viaarxiv icon

ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents

Add code
May 24, 2026
Viaarxiv icon

Ishigaki-IDS-Bench: A Benchmark for Generating Information Delivery Specification from BIM Information Requirements

Add code
May 21, 2026
Viaarxiv icon

A Framework for Formalizing LLM Agent Security

Add code
Mar 19, 2026
Viaarxiv icon

Order Matters in Retrosynthesis: Structure-aware Generation via Reaction-Center-Guided Discrete Flow Matching

Add code
Feb 13, 2026
Viaarxiv icon

ALPBench: A Benchmark for Attribution-level Long-term Personal Behavior Understanding

Add code
Feb 03, 2026
Viaarxiv icon

Persona-aware and Explainable Bikeability Assessment: A Vision-Language Model Approach

Add code
Jan 07, 2026
Viaarxiv icon