Picture for Jie Tang

Jie Tang

Tony

Action Draft and Verify: A Self-Verifying Framework for Vision-Language-Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target Detection

Add code
Mar 17, 2026
Viaarxiv icon

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Add code
Mar 12, 2026
Viaarxiv icon

GLM-OCR Technical Report

Add code
Mar 11, 2026
Viaarxiv icon

SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization

Add code
Mar 10, 2026
Viaarxiv icon

TraceSIR: A Multi-Agent Framework for Structured Analysis and Reporting of Agentic Execution Traces

Add code
Feb 28, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates

Add code
Jan 31, 2026
Viaarxiv icon

DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion

Add code
Jan 30, 2026
Viaarxiv icon

Gaussian Belief Propagation Network for Depth Completion

Add code
Jan 29, 2026
Viaarxiv icon