Picture for Zehai He

Zehai He

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification

Add code
Mar 27, 2026
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

LVBench: An Extreme Long Video Understanding Benchmark

Add code
Jun 12, 2024
Figure 1 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 2 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 3 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 4 for LVBench: An Extreme Long Video Understanding Benchmark
Viaarxiv icon

GPT Can Solve Mathematical Problems Without a Calculator

Add code
Sep 12, 2023
Figure 1 for GPT Can Solve Mathematical Problems Without a Calculator
Figure 2 for GPT Can Solve Mathematical Problems Without a Calculator
Figure 3 for GPT Can Solve Mathematical Problems Without a Calculator
Figure 4 for GPT Can Solve Mathematical Problems Without a Calculator
Viaarxiv icon