Picture for Xin He

Xin He

From Plausibility to Verifiability: Risk-Controlled Generative OCR for Vision-Language Models

Add code
Mar 20, 2026
Viaarxiv icon

ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents

Add code
Mar 20, 2026
Viaarxiv icon

VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models

Add code
Mar 19, 2026
Viaarxiv icon

RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

Add code
Mar 12, 2026
Viaarxiv icon

Stochastic Discount Factors with Cross-Asset Spillovers

Add code
Feb 24, 2026
Viaarxiv icon

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Add code
Jan 27, 2026
Viaarxiv icon

FC-MIR: A Mobile Screen Awareness Framework for Intent-Aware Recommendation based on Frame-Compressed Multimodal Trajectory Reasoning

Add code
Dec 22, 2025
Viaarxiv icon

Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling

Add code
Nov 11, 2025
Viaarxiv icon

User Hesitation and Negative Transfer in Multi-Behavior Recommendation

Add code
Nov 08, 2025
Viaarxiv icon

Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion

Add code
Sep 19, 2025
Figure 1 for Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion
Figure 2 for Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion
Figure 3 for Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion
Figure 4 for Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion
Viaarxiv icon