Picture for Hang Yan

Hang Yan

Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning

Add code
Sep 16, 2025
Figure 1 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 2 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 3 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 4 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Viaarxiv icon

Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories

Add code
Sep 16, 2025
Figure 1 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 2 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 3 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 4 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Viaarxiv icon

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark

Add code
Aug 26, 2025
Viaarxiv icon

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law

Add code
Jun 16, 2025
Figure 1 for Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law
Figure 2 for Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law
Figure 3 for Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law
Figure 4 for Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law
Viaarxiv icon

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

Add code
Jun 14, 2025
Figure 1 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 2 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 3 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 4 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Viaarxiv icon

RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization

Add code
May 16, 2025
Viaarxiv icon

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Add code
Apr 11, 2025
Figure 1 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 2 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 3 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 4 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Viaarxiv icon

$φ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Add code
Mar 17, 2025
Viaarxiv icon

GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?

Add code
Dec 13, 2024
Viaarxiv icon

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices

Add code
Sep 03, 2024
Figure 1 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 2 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 3 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 4 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Viaarxiv icon