Picture for Peng Shi

Peng Shi

MobileDreamer: Generative Sketch World Model for GUI Agent

Add code
Jan 07, 2026
Viaarxiv icon

Learning When to Look: A Disentangled Curriculum for Strategic Perception in Multimodal Reasoning

Add code
Dec 19, 2025
Viaarxiv icon

FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration

Add code
Dec 12, 2025
Figure 1 for FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
Figure 2 for FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
Figure 3 for FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
Figure 4 for FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
Viaarxiv icon

Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning

Add code
Oct 23, 2025
Viaarxiv icon

Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning

Add code
Jun 16, 2025
Viaarxiv icon

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Add code
Apr 29, 2025
Figure 1 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 2 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 3 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 4 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Viaarxiv icon

Discrimination-free Insurance Pricing with Privatized Sensitive Attributes

Add code
Apr 16, 2025
Figure 1 for Discrimination-free Insurance Pricing with Privatized Sensitive Attributes
Figure 2 for Discrimination-free Insurance Pricing with Privatized Sensitive Attributes
Figure 3 for Discrimination-free Insurance Pricing with Privatized Sensitive Attributes
Figure 4 for Discrimination-free Insurance Pricing with Privatized Sensitive Attributes
Viaarxiv icon

You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Add code
Sep 18, 2024
Figure 1 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL
Figure 2 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL
Figure 3 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL
Figure 4 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL
Viaarxiv icon

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Add code
Aug 15, 2024
Figure 1 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 2 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 3 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 4 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Viaarxiv icon

Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning

Add code
Nov 07, 2023
Figure 1 for Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
Figure 2 for Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
Figure 3 for Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
Figure 4 for Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
Viaarxiv icon