Picture for Zhiheng Lyu

Zhiheng Lyu

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

Add code
Mar 21, 2026
Viaarxiv icon

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Add code
Mar 17, 2026
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy

Add code
Jun 10, 2025
Figure 1 for Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy
Figure 2 for Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy
Figure 3 for Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy
Figure 4 for Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy
Viaarxiv icon

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Add code
May 26, 2025
Figure 1 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 2 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 3 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 4 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Viaarxiv icon

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Add code
Apr 03, 2025
Viaarxiv icon

PixelWorld: Towards Perceiving Everything as Pixels

Add code
Jan 31, 2025
Viaarxiv icon

FACTTRACK: Time-Aware World State Tracking in Story Outlines

Add code
Jul 23, 2024
Viaarxiv icon

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Add code
Jun 24, 2024
Figure 1 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 2 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 3 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 4 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Viaarxiv icon

On the Causal Nature of Sentiment Analysis

Add code
Apr 17, 2024
Figure 1 for On the Causal Nature of Sentiment Analysis
Figure 2 for On the Causal Nature of Sentiment Analysis
Figure 3 for On the Causal Nature of Sentiment Analysis
Figure 4 for On the Causal Nature of Sentiment Analysis
Viaarxiv icon