Picture for Xiao Yang

Xiao Yang

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic

Add code
Dec 24, 2025
Viaarxiv icon

EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks

Add code
Nov 16, 2025
Viaarxiv icon

KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs

Add code
Nov 09, 2025
Viaarxiv icon

Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin

Add code
Nov 08, 2025
Viaarxiv icon

CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark

Add code
Oct 30, 2025
Figure 1 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 2 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 3 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 4 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Viaarxiv icon

Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents

Add code
Oct 09, 2025
Figure 1 for Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
Figure 2 for Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
Figure 3 for Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
Figure 4 for Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
Viaarxiv icon

KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering

Add code
Sep 05, 2025
Viaarxiv icon

Deep Reinforcement Learning for Ranking Utility Tuning in the Ad Recommender System at Pinterest

Add code
Sep 05, 2025
Viaarxiv icon

Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement

Add code
Jul 10, 2025
Viaarxiv icon

Next-User Retrieval: Enhancing Cold-Start Recommendations via Generative Next-User Modeling

Add code
Jun 18, 2025
Viaarxiv icon