Picture for Zhiqiang Lin

Zhiqiang Lin

PAuth - Precise Task-Scoped Authorization For Agents

Add code
Mar 17, 2026
Viaarxiv icon

A Survey of Reasoning in Autonomous Driving Systems: Open Challenges and Emerging Paradigms

Add code
Mar 11, 2026
Viaarxiv icon

Atomicity for Agents: Exposing, Exploiting, and Mitigating TOCTOU Vulnerabilities in Browser-Use Agents

Add code
Feb 28, 2026
Viaarxiv icon

Web Verbs: Typed Abstractions for Reliable Task Composition on the Agentic Web

Add code
Feb 19, 2026
Viaarxiv icon

SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature

Add code
Jan 15, 2026
Viaarxiv icon

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Add code
May 28, 2025
Viaarxiv icon

C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation

Add code
Apr 21, 2025
Figure 1 for C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation
Figure 2 for C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation
Figure 3 for C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation
Figure 4 for C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation
Viaarxiv icon

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

Add code
Dec 15, 2023
Figure 1 for Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Figure 2 for Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Figure 3 for Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Figure 4 for Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Viaarxiv icon