Picture for Kaiyuan Zhang

Kaiyuan Zhang

Equipping Retrieval-Augmented Large Language Models with Document Structure Awareness

Add code
Oct 05, 2025
Viaarxiv icon

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Add code
Sep 16, 2025
Viaarxiv icon

Chinese Court Simulation with LLM-Based Agent System

Add code
Aug 24, 2025
Viaarxiv icon

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

Add code
Jun 12, 2025
Viaarxiv icon

IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents

Add code
Jun 09, 2025
Viaarxiv icon

LLM Agents Should Employ Security Principles

Add code
May 29, 2025
Viaarxiv icon

MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation

Add code
May 27, 2025
Viaarxiv icon

CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR

Add code
May 24, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

$μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models

Add code
Apr 01, 2025
Figure 1 for $μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models
Figure 2 for $μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models
Figure 3 for $μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models
Figure 4 for $μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models
Viaarxiv icon