Picture for Zhiwei Xu

Zhiwei Xu

Reinforced Efficient Reasoning via Semantically Diverse Exploration

Add code
Jan 08, 2026
Viaarxiv icon

Adversarial Contrastive Learning for LLM Quantization Attacks

Add code
Jan 06, 2026
Viaarxiv icon

Bridging the Capability Gap: Joint Alignment Tuning for Harmonizing LLM-based Multi-Agent Systems

Add code
Sep 11, 2025
Viaarxiv icon

SLIM: Subtrajectory-Level Elimination for More Effective Reasoning

Add code
Aug 27, 2025
Figure 1 for SLIM: Subtrajectory-Level Elimination for More Effective Reasoning
Figure 2 for SLIM: Subtrajectory-Level Elimination for More Effective Reasoning
Figure 3 for SLIM: Subtrajectory-Level Elimination for More Effective Reasoning
Figure 4 for SLIM: Subtrajectory-Level Elimination for More Effective Reasoning
Viaarxiv icon

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

Add code
Jun 14, 2025
Figure 1 for Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs
Figure 2 for Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs
Figure 3 for Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs
Figure 4 for Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs
Viaarxiv icon

NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence

Add code
Apr 30, 2025
Viaarxiv icon

Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

Add code
Apr 17, 2025
Viaarxiv icon

QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?

Add code
Apr 17, 2025
Figure 1 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 2 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 3 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 4 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Viaarxiv icon

Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild

Add code
Apr 17, 2025
Viaarxiv icon

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Add code
Apr 14, 2025
Viaarxiv icon