Picture for Wanxiang Che

Wanxiang Che

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model

Add code
Apr 22, 2026
Viaarxiv icon

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

Add code
Apr 07, 2026
Viaarxiv icon

EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction

Add code
Mar 24, 2026
Viaarxiv icon

ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling

Add code
Mar 10, 2026
Viaarxiv icon

ArcLight: A Lightweight LLM Inference Architecture for Many-Core CPUs

Add code
Mar 08, 2026
Viaarxiv icon

Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

Add code
Feb 13, 2026
Viaarxiv icon

How Do Language Models Understand Tables? A Mechanistic Analysis of Cell Location

Add code
Feb 09, 2026
Viaarxiv icon

When Does Context Help? Error Dynamics of Contextual Information in Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability

Add code
Feb 03, 2026
Viaarxiv icon

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

Add code
Jan 30, 2026
Viaarxiv icon