Picture for Wenpeng Yin

Wenpeng Yin

The Tool Illusion: Rethinking Tool Use in Web Agents

Add code
Apr 03, 2026
Viaarxiv icon

M2-Verify: A Large-Scale Multidomain Benchmark for Checking Multimodal Claim Consistency

Add code
Apr 01, 2026
Viaarxiv icon

Accurate and Scalable Matrix Mechanisms via Divide and Conquer

Add code
Apr 01, 2026
Viaarxiv icon

Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning

Add code
Mar 23, 2026
Viaarxiv icon

Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

Add code
Mar 05, 2026
Viaarxiv icon

Understanding Dynamic Compute Allocation in Recurrent Transformers

Add code
Feb 09, 2026
Viaarxiv icon

DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning

Add code
Jan 12, 2026
Viaarxiv icon

ScaleFormer: Span Representation Cumulation for Long-Context Transformer

Add code
Nov 13, 2025
Figure 1 for ScaleFormer: Span Representation Cumulation for Long-Context Transformer
Figure 2 for ScaleFormer: Span Representation Cumulation for Long-Context Transformer
Figure 3 for ScaleFormer: Span Representation Cumulation for Long-Context Transformer
Figure 4 for ScaleFormer: Span Representation Cumulation for Long-Context Transformer
Viaarxiv icon

SIM: A mapping framework for built environment auditing based on street view imagery

Add code
May 29, 2025
Viaarxiv icon

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Add code
Apr 29, 2025
Figure 1 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 2 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 3 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 4 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Viaarxiv icon