Picture for Min Zhang

Min Zhang

Jake

Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Add code
Jun 11, 2025
Viaarxiv icon

Unlocking Recursive Thinking of LLMs: Alignment via Refinement

Add code
Jun 06, 2025
Viaarxiv icon

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Add code
Jun 05, 2025
Viaarxiv icon

SplitLoRA: Balancing Stability and Plasticity in Continual Learning Through Gradient Space Splitting

Add code
May 29, 2025
Viaarxiv icon

Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing

Add code
May 28, 2025
Viaarxiv icon

Contrastive Learning on LLM Back Generation Treebank for Cross-domain Constituency Parsing

Add code
May 27, 2025
Viaarxiv icon

Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Add code
May 27, 2025
Viaarxiv icon

XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration

Add code
May 27, 2025
Viaarxiv icon

REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models

Add code
May 26, 2025
Viaarxiv icon

AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems

Add code
May 26, 2025
Figure 1 for AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems
Figure 2 for AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems
Figure 3 for AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems
Figure 4 for AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems
Viaarxiv icon