Picture for Lei Hou

Lei Hou

WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection

Add code
Oct 21, 2025
Viaarxiv icon

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Add code
Oct 02, 2025
Viaarxiv icon

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Add code
Jun 11, 2025
Viaarxiv icon

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Add code
Jun 04, 2025
Viaarxiv icon

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

Are Reasoning Models More Prone to Hallucination?

Add code
May 29, 2025
Viaarxiv icon

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Add code
May 22, 2025
Viaarxiv icon

AdaptThink: Reasoning Models Can Learn When to Think

Add code
May 19, 2025
Viaarxiv icon

LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning

Add code
May 04, 2025
Viaarxiv icon