Picture for Zifeng Wang

Zifeng Wang

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Add code
Oct 29, 2025
Viaarxiv icon

Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation

Add code
May 28, 2025
Figure 1 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 2 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 3 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 4 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Viaarxiv icon

BioDSA-1K: Benchmarking Data Science Agents for Biomedical Research

Add code
May 22, 2025
Viaarxiv icon

TrialPanorama: Database and Benchmark for Systematic Review and Design of Clinical Trials

Add code
May 22, 2025
Viaarxiv icon

s3: You Don't Need That Much Data to Train a Search Agent via RL

Add code
May 20, 2025
Viaarxiv icon

InformGen: An AI Copilot for Accurate and Compliant Clinical Research Consent Document Generation

Add code
Apr 01, 2025
Viaarxiv icon

In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents

Add code
Mar 11, 2025
Viaarxiv icon

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Add code
Mar 10, 2025
Viaarxiv icon

STAR: Stability-Inducing Weight Perturbation for Continual Learning

Add code
Mar 03, 2025
Figure 1 for STAR: Stability-Inducing Weight Perturbation for Continual Learning
Figure 2 for STAR: Stability-Inducing Weight Perturbation for Continual Learning
Figure 3 for STAR: Stability-Inducing Weight Perturbation for Continual Learning
Figure 4 for STAR: Stability-Inducing Weight Perturbation for Continual Learning
Viaarxiv icon

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Add code
Feb 22, 2025
Viaarxiv icon