Picture for Xiaozhong Liu

Xiaozhong Liu

Can We Trust a Black-box LLM? LLM Untrustworthy Boundary Detection via Bias-Diffusion and Multi-Agent Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

PubMed Reasoner: Dynamic Reasoning-based Retrieval for Evidence-Grounded Biomedical Question Answering

Add code
Mar 28, 2026
Viaarxiv icon

P2S: Probabilistic Process Supervision for General-Domain Reasoning Question Answering

Add code
Jan 28, 2026
Viaarxiv icon

VitalDiagnosis: AI-Driven Ecosystem for 24/7 Vital Monitoring and Chronic Disease Management

Add code
Jan 22, 2026
Viaarxiv icon

Human-in-the-Loop Interactive Report Generation for Chronic Disease Adherence

Add code
Jan 10, 2026
Viaarxiv icon

Teaching According to Students' Aptitude: Personalized Mathematics Tutoring via Persona-, Memory-, and Forgetting-Aware LLMs

Add code
Nov 19, 2025
Viaarxiv icon

Active Domain Knowledge Acquisition with \$100 Budget: Enhancing LLMs via Cost-Efficient, Expert-Involved Interaction in Sensitive Domains

Add code
Aug 24, 2025
Viaarxiv icon

LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

Add code
May 26, 2025
Figure 1 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 2 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 3 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 4 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Viaarxiv icon

CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs

Add code
May 24, 2025
Viaarxiv icon

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

Add code
May 22, 2025
Figure 1 for AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
Figure 2 for AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
Figure 3 for AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
Figure 4 for AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
Viaarxiv icon