Picture for Zexi Li

Zexi Li

IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Add code
Jan 07, 2026
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

FedEve: On Bridging the Client Drift and Period Drift for Cross-device Federated Learning

Add code
Aug 20, 2025
Viaarxiv icon

AdaFusion: Prompt-Guided Inference with Adaptive Fusion of Pathology Foundation Models

Add code
Aug 07, 2025
Viaarxiv icon

Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

Add code
May 26, 2025
Viaarxiv icon

You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data

Add code
Mar 10, 2025
Viaarxiv icon

Photon: Federated LLM Pre-Training

Add code
Nov 05, 2024
Figure 1 for Photon: Federated LLM Pre-Training
Figure 2 for Photon: Federated LLM Pre-Training
Figure 3 for Photon: Federated LLM Pre-Training
Figure 4 for Photon: Federated LLM Pre-Training
Viaarxiv icon

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering

Add code
Sep 24, 2024
Figure 1 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 2 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 3 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 4 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Viaarxiv icon

Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization

Add code
May 23, 2024
Viaarxiv icon