Picture for Li Zhu

Li Zhu

MultiDx: A Multi-Source Knowledge Integration Framework towards Diagnostic Reasoning

Add code
Apr 27, 2026
Viaarxiv icon

AdapTime: Enabling Adaptive Temporal Reasoning in Large Language Models

Add code
Apr 27, 2026
Viaarxiv icon

Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation

Add code
Apr 02, 2026
Viaarxiv icon

Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting

Add code
Mar 29, 2026
Viaarxiv icon

Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection

Add code
Mar 02, 2026
Viaarxiv icon

Semantic-Deviation-Anchored Multi-Branch Fusion for Unsupervised Anomaly Detection and Localization in Unstructured Conveyor-Belt Coal Scenes

Add code
Feb 07, 2026
Viaarxiv icon

Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation

Add code
Jan 29, 2026
Viaarxiv icon

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Add code
Jan 18, 2026
Viaarxiv icon

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Add code
Jan 18, 2026
Viaarxiv icon

Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI

Add code
Jul 16, 2025
Figure 1 for Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI
Figure 2 for Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI
Figure 3 for Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI
Figure 4 for Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI
Viaarxiv icon