Picture for Xuebo Liu

Xuebo Liu

REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models

Add code
May 26, 2025
Viaarxiv icon

Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments

Add code
May 23, 2025
Viaarxiv icon

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration

Add code
Mar 24, 2025
Viaarxiv icon

Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis

Add code
Feb 18, 2025
Viaarxiv icon

DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

Add code
Dec 19, 2024
Viaarxiv icon

Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering

Add code
Dec 18, 2024
Figure 1 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 2 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 3 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 4 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Viaarxiv icon

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization

Add code
Nov 21, 2024
Figure 1 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 2 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 3 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Figure 4 for DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Viaarxiv icon

NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates

Add code
Oct 28, 2024
Figure 1 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 2 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 3 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Figure 4 for NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Viaarxiv icon

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Add code
Oct 10, 2024
Figure 1 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 2 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 3 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 4 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Viaarxiv icon