Picture for Qingwei Lin

Qingwei Lin

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

Add code
Apr 23, 2026
Viaarxiv icon

Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks

Add code
Apr 20, 2026
Viaarxiv icon

DUET: Joint Exploration of User Item Profiles in Recommendation System

Add code
Apr 15, 2026
Viaarxiv icon

Beyond State Consistency: Behavior Consistency in Text-Based World Models

Add code
Apr 15, 2026
Viaarxiv icon

WebXSkill: Skill Learning for Autonomous Web Agents

Add code
Apr 14, 2026
Viaarxiv icon

ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents

Add code
Apr 09, 2026
Viaarxiv icon

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

Add code
Apr 07, 2026
Viaarxiv icon

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Add code
Mar 05, 2026
Viaarxiv icon

A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research

Add code
Feb 14, 2026
Viaarxiv icon

GUI-360$^\circ$: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 10, 2025
Viaarxiv icon