Picture for Yuling Shi

Yuling Shi

Code Is More Than Text: Uncertainty Estimation for Code Generation

Add code
Jun 08, 2026
Viaarxiv icon

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Add code
Jun 05, 2026
Viaarxiv icon

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

Add code
May 28, 2026
Viaarxiv icon

Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing

Add code
May 06, 2026
Viaarxiv icon

ShredBench: Evaluating the Semantic Reasoning Capabilities of Multimodal LLMs in Document Reconstruction

Add code
Apr 26, 2026
Viaarxiv icon

EffiSkill: Agent Skill Based Automated Code Efficiency Optimization

Add code
Mar 29, 2026
Viaarxiv icon

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents

Add code
Feb 08, 2026
Viaarxiv icon

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Add code
Feb 03, 2026
Viaarxiv icon

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Add code
Feb 02, 2026
Viaarxiv icon

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Add code
Jan 23, 2026
Viaarxiv icon