Picture for Xiaomo Liu

Xiaomo Liu

Detecting Non-Membership in LLM Training Data via Rank Correlations

Add code
Mar 24, 2026
Viaarxiv icon

Distill and Align Decomposition for Enhanced Claim Verification

Add code
Feb 25, 2026
Viaarxiv icon

ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images

Add code
Feb 12, 2026
Viaarxiv icon

Perturb Your Data: Paraphrase-Guided Training Data Watermarking

Add code
Dec 18, 2025
Figure 1 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 2 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 3 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 4 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Viaarxiv icon

CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation

Add code
Aug 07, 2025
Viaarxiv icon

Entropy-Aware Branching for Improved Mathematical Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

"What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs

Add code
Oct 20, 2024
Figure 1 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 2 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 3 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 4 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Viaarxiv icon

Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation

Add code
Oct 03, 2024
Figure 1 for Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation
Figure 2 for Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation
Figure 3 for Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation
Figure 4 for Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation
Viaarxiv icon

CodeMirage: Hallucinations in Code Generated by Large Language Models

Add code
Aug 14, 2024
Figure 1 for CodeMirage: Hallucinations in Code Generated by Large Language Models
Figure 2 for CodeMirage: Hallucinations in Code Generated by Large Language Models
Figure 3 for CodeMirage: Hallucinations in Code Generated by Large Language Models
Figure 4 for CodeMirage: Hallucinations in Code Generated by Large Language Models
Viaarxiv icon

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

Add code
Apr 05, 2024
Viaarxiv icon