Picture for Conghui He

Conghui He

IPCV: Information-Preserving Compression for MLLM Visual Encoders

Add code
Dec 21, 2025
Viaarxiv icon

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Add code
Dec 18, 2025
Viaarxiv icon

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Add code
Dec 16, 2025
Viaarxiv icon

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Add code
Dec 11, 2025
Viaarxiv icon

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Add code
Dec 11, 2025
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

Add code
Nov 14, 2025
Viaarxiv icon

OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild

Add code
Nov 11, 2025
Viaarxiv icon

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Add code
Oct 30, 2025
Viaarxiv icon

Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs

Add code
Oct 27, 2025
Viaarxiv icon