Picture for Zhenyu Hou

Zhenyu Hou

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Add code
Jun 13, 2025
Viaarxiv icon

SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling

Add code
Jun 09, 2025
Viaarxiv icon

Controlling Large Language Model with Latent Actions

Add code
Mar 27, 2025
Viaarxiv icon

Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process

Add code
Mar 06, 2025
Viaarxiv icon

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Add code
Jan 20, 2025
Figure 1 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 2 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 3 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 4 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Viaarxiv icon

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Add code
Dec 08, 2024
Figure 1 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 2 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 3 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 4 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Viaarxiv icon

SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Add code
Oct 29, 2024
Viaarxiv icon

LongReward: Improving Long-context Large Language Models with AI Feedback

Add code
Oct 28, 2024
Viaarxiv icon

Generalizing Graph Transformers Across Diverse Graphs and Tasks via Pre-Training on Industrial-Scale Data

Add code
Jul 04, 2024
Viaarxiv icon