Picture for Haifeng Wang

Haifeng Wang

Advantageous Parameter Expansion Training Makes Better Large Language Models

Add code
May 30, 2025
Viaarxiv icon

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

Add code
May 27, 2025
Viaarxiv icon

HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices

Add code
May 26, 2025
Viaarxiv icon

TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments

Add code
May 23, 2025
Viaarxiv icon

ToolSpectrum : Towards Personalized Tool Utilization for Large Language Models

Add code
May 19, 2025
Viaarxiv icon

Unveiling Knowledge Utilization Mechanisms in LLM-based Retrieval-Augmented Generation

Add code
May 17, 2025
Viaarxiv icon

Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking

Add code
Feb 19, 2025
Viaarxiv icon

BeamLoRA: Beam-Constraint Low-Rank Adaptation

Add code
Feb 19, 2025
Viaarxiv icon

OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Add code
Feb 05, 2025
Figure 1 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds
Figure 2 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds
Figure 3 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds
Figure 4 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds
Viaarxiv icon

Curiosity-Driven Reinforcement Learning from Human Feedback

Add code
Jan 20, 2025
Figure 1 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 2 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 3 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 4 for Curiosity-Driven Reinforcement Learning from Human Feedback
Viaarxiv icon