Picture for Dongmei Zhang

Dongmei Zhang

ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents

Add code
Apr 09, 2026
Viaarxiv icon

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

Add code
Apr 07, 2026
Viaarxiv icon

Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs

Add code
Mar 14, 2026
Viaarxiv icon

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Add code
Mar 05, 2026
Viaarxiv icon

Test-Time Learning of Causal Structure from Interventional Data

Add code
Feb 22, 2026
Viaarxiv icon

A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research

Add code
Feb 14, 2026
Viaarxiv icon

GUI-360$^\circ$: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 10, 2025
Viaarxiv icon

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 06, 2025
Viaarxiv icon

SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets

Add code
Oct 22, 2025
Viaarxiv icon

Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search

Add code
Sep 11, 2025
Figure 1 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 2 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 3 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 4 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Viaarxiv icon