Picture for Chuxue Cao

Chuxue Cao

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Add code
Jan 30, 2026
Viaarxiv icon

LRAS: Advanced Legal Reasoning with Agentic Search

Add code
Jan 12, 2026
Viaarxiv icon

MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data

Add code
Dec 15, 2025
Viaarxiv icon

SafeLawBench: Towards Safe Alignment of Large Language Models

Add code
Jun 07, 2025
Figure 1 for SafeLawBench: Towards Safe Alignment of Large Language Models
Figure 2 for SafeLawBench: Towards Safe Alignment of Large Language Models
Figure 3 for SafeLawBench: Towards Safe Alignment of Large Language Models
Figure 4 for SafeLawBench: Towards Safe Alignment of Large Language Models
Viaarxiv icon

Measuring Hong Kong Massive Multi-Task Language Understanding

Add code
May 04, 2025
Viaarxiv icon