Picture for Yao Liu

Yao Liu

A Distributed Framework for Privacy-Enhanced Vision Transformers on the Edge

Add code
Dec 10, 2025
Viaarxiv icon

Exploiting Inter-Session Information with Frequency-enhanced Dual-Path Networks for Sequential Recommendation

Add code
Nov 14, 2025
Viaarxiv icon

Ask a Strong LLM Judge when Your Reward Model is Uncertain

Add code
Oct 23, 2025
Viaarxiv icon

Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals

Add code
Aug 11, 2025
Figure 1 for Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals
Figure 2 for Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals
Figure 3 for Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals
Figure 4 for Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals
Viaarxiv icon

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Teaching Large Language Models to Reason through Learning and Forgetting

Add code
Apr 15, 2025
Viaarxiv icon

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

Add code
Mar 15, 2025
Viaarxiv icon

D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning

Add code
Mar 14, 2025
Figure 1 for D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Figure 2 for D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Figure 3 for D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Figure 4 for D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Viaarxiv icon

Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving

Add code
Mar 09, 2025
Viaarxiv icon