Picture for Zirui Liu

Zirui Liu

Automating Expert-Level Medical Reasoning Evaluation of Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts?

Add code
Jun 05, 2025
Viaarxiv icon

SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA

Add code
May 29, 2025
Viaarxiv icon

Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model

Add code
May 28, 2025
Viaarxiv icon

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems

Add code
May 23, 2025
Viaarxiv icon

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

Add code
May 22, 2025
Viaarxiv icon

am-ELO: A Stable Framework for Arena-based LLM Evaluation

Add code
May 06, 2025
Viaarxiv icon

Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models

Add code
Apr 05, 2025
Viaarxiv icon

EPEE: Towards Efficient and Effective Foundation Models in Biomedicine

Add code
Mar 03, 2025
Viaarxiv icon