Picture for Shu Wang

Shu Wang

MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents

Add code
Apr 06, 2026
Viaarxiv icon

LLM+Graph@VLDB'2025 Workshop Summary

Add code
Apr 03, 2026
Viaarxiv icon

Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement

Add code
Mar 30, 2026
Viaarxiv icon

VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

Add code
Feb 23, 2026
Viaarxiv icon

Integrated Exploration and Sequential Manipulation on Scene Graph with LLM-based Situated Replanning

Add code
Feb 04, 2026
Viaarxiv icon

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 2 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 3 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 4 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Viaarxiv icon

Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining

Add code
May 22, 2025
Figure 1 for Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Figure 2 for Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Figure 3 for Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Figure 4 for Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Viaarxiv icon

ChainMarks: Securing DNN Watermark with Cryptographic Chain

Add code
May 08, 2025
Figure 1 for ChainMarks: Securing DNN Watermark with Cryptographic Chain
Figure 2 for ChainMarks: Securing DNN Watermark with Cryptographic Chain
Figure 3 for ChainMarks: Securing DNN Watermark with Cryptographic Chain
Figure 4 for ChainMarks: Securing DNN Watermark with Cryptographic Chain
Viaarxiv icon

R^3-VQA: "Read the Room" by Video Social Reasoning

Add code
May 07, 2025
Figure 1 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 2 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 3 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 4 for R^3-VQA: "Read the Room" by Video Social Reasoning
Viaarxiv icon

DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification

Add code
Apr 15, 2025
Figure 1 for DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
Figure 2 for DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
Figure 3 for DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
Figure 4 for DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
Viaarxiv icon