Picture for Xinyu Dai

Xinyu Dai

PEMark: Watermarking API Responses Based on Proxy Gateways and Position Encoding

Add code
May 21, 2026
Viaarxiv icon

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

Add code
May 19, 2026
Viaarxiv icon

Optimizing RAG Rerankers with LLM Feedback via Reinforcement Learning

Add code
Apr 02, 2026
Viaarxiv icon

WebNavigator: Global Web Navigation via Interaction Graph Retrieval

Add code
Mar 20, 2026
Viaarxiv icon

Dynamic Decision-Making under Model Misspecification: A Stochastic Stability Approach

Add code
Feb 19, 2026
Viaarxiv icon

Persona-Aware Alignment Framework for Personalized Dialogue Generation

Add code
Nov 13, 2025
Viaarxiv icon

Counterfactual Language Reasoning for Explainable Recommendation Systems

Add code
Mar 11, 2025
Figure 1 for Counterfactual Language Reasoning for Explainable Recommendation Systems
Figure 2 for Counterfactual Language Reasoning for Explainable Recommendation Systems
Figure 3 for Counterfactual Language Reasoning for Explainable Recommendation Systems
Figure 4 for Counterfactual Language Reasoning for Explainable Recommendation Systems
Viaarxiv icon

PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues

Add code
Feb 28, 2025
Viaarxiv icon

Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark

Add code
Jan 02, 2025
Figure 1 for Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark
Figure 2 for Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark
Figure 3 for Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark
Figure 4 for Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark
Viaarxiv icon

GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models

Add code
Dec 30, 2024
Figure 1 for GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
Figure 2 for GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
Figure 3 for GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
Figure 4 for GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
Viaarxiv icon