Picture for Yun Yue

Yun Yue

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Add code
Nov 18, 2025
Figure 1 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 2 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 3 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Figure 4 for Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Viaarxiv icon

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Add code
Aug 20, 2025
Viaarxiv icon

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Add code
Aug 11, 2025
Figure 1 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 2 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 3 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Figure 4 for Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment
Viaarxiv icon

EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models

Add code
Dec 10, 2024
Figure 1 for EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
Figure 2 for EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
Figure 3 for EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
Figure 4 for EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
Viaarxiv icon

Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion Posts

Add code
Sep 25, 2024
Figure 1 for Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion Posts
Figure 2 for Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion Posts
Figure 3 for Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion Posts
Figure 4 for Wildlife Product Trading in Online Social Networks: A Case Study on Ivory-Related Product Sales Promotion Posts
Viaarxiv icon

Understanding Hyperbolic Metric Learning through Hard Negative Sampling

Add code
Apr 23, 2024
Viaarxiv icon

AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix

Add code
Dec 04, 2023
Figure 1 for AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix
Figure 2 for AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix
Figure 3 for AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix
Figure 4 for AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix
Viaarxiv icon

Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term

Add code
May 25, 2023
Figure 1 for Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Figure 2 for Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Figure 3 for Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Figure 4 for Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Viaarxiv icon

Hyperbolic Contrastive Learning

Add code
Feb 02, 2023
Figure 1 for Hyperbolic Contrastive Learning
Figure 2 for Hyperbolic Contrastive Learning
Figure 3 for Hyperbolic Contrastive Learning
Figure 4 for Hyperbolic Contrastive Learning
Viaarxiv icon

Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction

Add code
Jul 30, 2021
Figure 1 for Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction
Figure 2 for Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction
Figure 3 for Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction
Figure 4 for Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction
Viaarxiv icon