Picture for Yuxia Wang

Yuxia Wang

The Geometry of Forgetting: Temporal Knowledge Drift as an Independent Axis in LLM Representations

Add code
May 09, 2026
Viaarxiv icon

SAHM: A Benchmark for Arabic Financial and Shari'ah-Compliant Reasoning

Add code
Apr 21, 2026
Viaarxiv icon

Harm or Humor: A Multimodal, Multilingual Benchmark for Overt and Covert Harmful Humor

Add code
Mar 19, 2026
Viaarxiv icon

The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems

Add code
Feb 11, 2026
Viaarxiv icon

Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection

Add code
Feb 09, 2026
Viaarxiv icon

RealFin: How Well Do LLMs Reason About Finance When Users Leave Things Unsaid?

Add code
Feb 06, 2026
Viaarxiv icon

AICD Bench: A Challenging Benchmark for AI-Generated Code Detection

Add code
Feb 02, 2026
Viaarxiv icon

How Does Prefix Matter in Reasoning Model Tuning?

Add code
Jan 04, 2026
Viaarxiv icon

Explicit and Implicit Data Augmentation for Social Event Detection

Add code
Sep 04, 2025
Viaarxiv icon

FRaN-X: FRaming and Narratives-eXplorer

Add code
Jul 09, 2025
Viaarxiv icon