Picture for Xianjun Yang

Xianjun Yang

Learning Personalized Agents from Human Feedback

Add code
Feb 18, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Dr. Zero: Self-Evolving Search Agents without Training Data

Add code
Jan 11, 2026
Viaarxiv icon

Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape

Add code
Aug 10, 2025
Figure 1 for Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
Figure 2 for Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
Figure 3 for Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
Figure 4 for Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
Viaarxiv icon

Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder

Add code
Feb 19, 2025
Viaarxiv icon

MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison

Add code
Feb 07, 2025
Figure 1 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 2 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 3 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 4 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Viaarxiv icon

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Add code
Nov 21, 2024
Viaarxiv icon

CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy

Add code
Oct 17, 2024
Figure 1 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 2 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 3 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 4 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Viaarxiv icon

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon

MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate

Add code
Jun 26, 2024
Figure 1 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 2 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 3 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 4 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Viaarxiv icon