Picture for Hongru Wang

Hongru Wang

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs

Add code
Apr 17, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon

NILE: Internal Consistency Alignment in Large Language Models

Add code
Dec 21, 2024
Viaarxiv icon

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

Add code
Nov 15, 2024
Figure 1 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 2 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 3 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 4 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Viaarxiv icon

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Add code
Oct 21, 2024
Figure 1 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 2 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 3 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 4 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Viaarxiv icon