Picture for Jingjing Zhang

Jingjing Zhang

Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video

Add code
Mar 14, 2026
Viaarxiv icon

Evaluating VLMs' Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences

Add code
Mar 13, 2026
Viaarxiv icon

Retrieving Patient-Specific Radiomic Feature Sets for Transparent Knee MRI Assessment

Add code
Mar 02, 2026
Viaarxiv icon

InternVLA-A1: Unifying Understanding, Generation and Action for Robotic Manipulation

Add code
Jan 05, 2026
Viaarxiv icon

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment

Add code
Nov 06, 2025
Viaarxiv icon

DMF2Mel: A Dynamic Multiscale Fusion Network for EEG-Driven Mel Spectrogram Reconstruction

Add code
Jul 10, 2025
Viaarxiv icon

MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection

Add code
May 21, 2025
Figure 1 for MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Figure 2 for MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Figure 3 for MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Figure 4 for MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Viaarxiv icon

SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework

Add code
Apr 18, 2025
Figure 1 for SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework
Figure 2 for SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework
Figure 3 for SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework
Figure 4 for SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework
Viaarxiv icon

Composed Multi-modal Retrieval: A Survey of Approaches and Applications

Add code
Mar 03, 2025
Figure 1 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 2 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 3 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Figure 4 for Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Viaarxiv icon

Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction

Add code
Jan 03, 2025
Figure 1 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 2 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 3 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 4 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Viaarxiv icon