Picture for Bin Wang

Bin Wang

and Other Contributors

AgentVLN: Towards Agentic Vision-and-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing

Add code
Mar 17, 2026
Viaarxiv icon

DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation

Add code
Mar 13, 2026
Viaarxiv icon

Mango-GS: Enhancing Spatio-Temporal Consistency in Dynamic Scenes Reconstruction using Multi-Frame Node-Guided 4D Gaussian Splatting

Add code
Mar 12, 2026
Viaarxiv icon

World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models

Add code
Mar 10, 2026
Viaarxiv icon

TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration

Add code
Mar 04, 2026
Viaarxiv icon

ShiftLUT: Spatial Shift Enhanced Look-Up Tables for Efficient Image Restoration

Add code
Mar 03, 2026
Viaarxiv icon

AgenticOCR: Parsing Only What You Need for Efficient Retrieval-Augmented Generation

Add code
Feb 27, 2026
Viaarxiv icon

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Add code
Feb 26, 2026
Viaarxiv icon

MoDora: Tree-Based Semi-Structured Document Analysis System

Add code
Feb 26, 2026
Viaarxiv icon