Picture for Xu Zhang

Xu Zhang

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Add code
Jan 07, 2026
Viaarxiv icon

ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration

Add code
Jan 06, 2026
Viaarxiv icon

DynaFix: Iterative Automated Program Repair Driven by Execution-Level Dynamic Information

Add code
Dec 31, 2025
Viaarxiv icon

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Add code
Dec 26, 2025
Viaarxiv icon

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Add code
Dec 22, 2025
Figure 1 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 2 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 3 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 4 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Viaarxiv icon

Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing

Add code
Dec 22, 2025
Figure 1 for Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing
Figure 2 for Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing
Figure 3 for Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing
Figure 4 for Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing
Viaarxiv icon

Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing

Add code
Dec 12, 2025
Viaarxiv icon

Multi-period Learning for Financial Time Series Forecasting

Add code
Nov 07, 2025
Figure 1 for Multi-period Learning for Financial Time Series Forecasting
Figure 2 for Multi-period Learning for Financial Time Series Forecasting
Figure 3 for Multi-period Learning for Financial Time Series Forecasting
Figure 4 for Multi-period Learning for Financial Time Series Forecasting
Viaarxiv icon

Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification

Add code
Nov 07, 2025
Figure 1 for Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification
Figure 2 for Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification
Figure 3 for Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification
Figure 4 for Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification
Viaarxiv icon

BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems

Add code
Nov 06, 2025
Viaarxiv icon