Picture for Jiaqi Wang

Jiaqi Wang

Michael Pokorny

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Add code
Oct 31, 2025
Viaarxiv icon

Whole-Body Proprioceptive Morphing: A Modular Soft Gripper for Robust Cross-Scale Grasping

Add code
Oct 31, 2025
Viaarxiv icon

CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark

Add code
Oct 30, 2025
Viaarxiv icon

The Universal Landscape of Human Reasoning

Add code
Oct 24, 2025
Viaarxiv icon

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Add code
Oct 02, 2025
Figure 1 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 2 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 3 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 4 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Viaarxiv icon

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Add code
Sep 26, 2025
Figure 1 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 2 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 3 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 4 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Viaarxiv icon

ConvergeWriter: Data-Driven Bottom-Up Article Construction

Add code
Sep 16, 2025
Viaarxiv icon

LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning

Add code
Sep 16, 2025
Viaarxiv icon