Picture for Jiayu Wang

Jiayu Wang

LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild

Add code
Oct 16, 2025
Viaarxiv icon

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Add code
Jun 06, 2025
Viaarxiv icon

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Add code
Jun 05, 2025
Viaarxiv icon

OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

Add code
May 24, 2025
Figure 1 for OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Figure 2 for OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Figure 3 for OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Figure 4 for OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Viaarxiv icon

COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Add code
Apr 30, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Figure 1 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 2 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 3 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 4 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Viaarxiv icon

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Figure 1 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 2 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 3 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 4 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Viaarxiv icon

InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems

Add code
Oct 21, 2024
Figure 1 for InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems
Figure 2 for InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems
Figure 3 for InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems
Figure 4 for InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems
Viaarxiv icon

Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia

Add code
Sep 25, 2024
Viaarxiv icon