Picture for Qi He

Qi He

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Add code
Apr 13, 2025
Viaarxiv icon

Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

Add code
Apr 11, 2025
Viaarxiv icon

Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Add code
Mar 27, 2025
Viaarxiv icon

HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard

Add code
Mar 18, 2025
Viaarxiv icon

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Add code
Mar 05, 2025
Viaarxiv icon

How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities

Add code
Feb 26, 2025
Viaarxiv icon

A General Framework to Enhance Fine-tuning-based LLM Unlearning

Add code
Feb 25, 2025
Viaarxiv icon

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design

Add code
Feb 18, 2025
Figure 1 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 2 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 3 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 4 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Viaarxiv icon

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon