Picture for Qi Zhang

Qi Zhang

NVIDIA

No Forgetting Learning: Memory-free Continual Learning

Add code
Mar 07, 2025
Figure 1 for No Forgetting Learning: Memory-free Continual Learning
Figure 2 for No Forgetting Learning: Memory-free Continual Learning
Figure 3 for No Forgetting Learning: Memory-free Continual Learning
Figure 4 for No Forgetting Learning: Memory-free Continual Learning
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI

Add code
Mar 04, 2025
Figure 1 for MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Figure 2 for MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Figure 3 for MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Figure 4 for MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Viaarxiv icon

PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information

Add code
Feb 28, 2025
Viaarxiv icon

Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?

Add code
Feb 26, 2025
Figure 1 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 2 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 3 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 4 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Viaarxiv icon

VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model

Add code
Feb 26, 2025
Figure 1 for VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Figure 2 for VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Figure 3 for VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Figure 4 for VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Viaarxiv icon

Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric

Add code
Feb 25, 2025
Figure 1 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 2 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 3 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 4 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Viaarxiv icon

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Add code
Feb 24, 2025
Viaarxiv icon

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon

Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise

Add code
Feb 19, 2025
Figure 1 for Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise
Figure 2 for Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise
Figure 3 for Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise
Figure 4 for Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise
Viaarxiv icon