Picture for Zixuan Wang

Zixuan Wang

Michael Pokorny

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

Scaling Latent Reasoning via Looped Language Models

Add code
Oct 29, 2025
Figure 1 for Scaling Latent Reasoning via Looped Language Models
Figure 2 for Scaling Latent Reasoning via Looped Language Models
Figure 3 for Scaling Latent Reasoning via Looped Language Models
Figure 4 for Scaling Latent Reasoning via Looped Language Models
Viaarxiv icon

On Continuous Optimization for Constraint Satisfaction Problems

Add code
Oct 06, 2025
Figure 1 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 2 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 3 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 4 for On Continuous Optimization for Constraint Satisfaction Problems
Viaarxiv icon

ANNIE: Be Careful of Your Robots

Add code
Sep 03, 2025
Viaarxiv icon

PP-Motion: Physical-Perceptual Fidelity Evaluation for Human Motion Generation

Add code
Aug 11, 2025
Viaarxiv icon

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon

FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance

Add code
Jul 23, 2025
Figure 1 for FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance
Figure 2 for FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance
Figure 3 for FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance
Figure 4 for FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance
Viaarxiv icon

Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation

Add code
Jul 23, 2025
Viaarxiv icon

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Add code
Jun 13, 2025
Viaarxiv icon