Picture for Hui Liu

Hui Liu

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

Add code
Feb 25, 2026
Viaarxiv icon

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

Add code
Feb 17, 2026
Viaarxiv icon

Fix Before Search: Benchmarking Agentic Query Visual Pre-processing in Multimodal Retrieval-augmented Generation

Add code
Feb 13, 2026
Viaarxiv icon

Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents

Add code
Feb 11, 2026
Viaarxiv icon

Attn-GS: Attention-Guided Context Compression for Efficient Personalized LLMs

Add code
Feb 08, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

AERO: Autonomous Evolutionary Reasoning Optimization via Endogenous Dual-Loop Feedback

Add code
Feb 03, 2026
Viaarxiv icon

Exposing Vulnerabilities in Explanation for Time Series Classifiers via Dual-Target Attacks

Add code
Feb 02, 2026
Viaarxiv icon

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use

Add code
Jan 31, 2026
Viaarxiv icon

Position: Agentic Evolution is the Path to Evolving LLMs

Add code
Jan 30, 2026
Viaarxiv icon