Picture for Jingyi Zhang

Jingyi Zhang

OpenClaw-Skill: Collective Skill Tree Search for Agentic Large Language Models

Add code
Jun 15, 2026
Viaarxiv icon

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

Add code
Apr 21, 2026
Viaarxiv icon

HyperLiDAR: Adaptive Post-Deployment LiDAR Segmentation via Hyperdimensional Computing

Add code
Apr 14, 2026
Viaarxiv icon

Reinforcing Structured Chain-of-Thought for Video Understanding

Add code
Mar 26, 2026
Viaarxiv icon

KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph

Add code
Mar 22, 2026
Viaarxiv icon

SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime

Add code
Mar 19, 2026
Viaarxiv icon

Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos

Add code
Mar 10, 2026
Viaarxiv icon

MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline

Add code
Mar 01, 2026
Viaarxiv icon

R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?

Add code
Feb 03, 2026
Viaarxiv icon

STILL: Selecting Tokens for Intra-Layer Hybrid Attention to Linearize LLMs

Add code
Feb 02, 2026
Viaarxiv icon