Picture for Wei Ji

Wei Ji

Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs

Add code
May 13, 2026
Viaarxiv icon

RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation

Add code
May 11, 2026
Viaarxiv icon

TexEditor: Structure-Preserving Text-Driven Texture Editing

Add code
Mar 19, 2026
Viaarxiv icon

Selective Noise Suppression and Discriminative Mutual Interaction for Robust Audio-Visual Segmentation

Add code
Mar 15, 2026
Viaarxiv icon

UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark

Add code
Mar 05, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

Interp3D: Correspondence-aware Interpolation for Generative Textured 3D Morphing

Add code
Jan 20, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation

Add code
Dec 27, 2025
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon