Picture for Jing Liu

Jing Liu

Perry

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Add code
Dec 19, 2025
Figure 1 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 2 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 3 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 4 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Viaarxiv icon

Tracking large chemical reaction networks and rare events by neural networks

Add code
Dec 13, 2025
Viaarxiv icon

Untethered thin dielectric elastomer actuated soft robot

Add code
Dec 12, 2025
Viaarxiv icon

UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories

Add code
Dec 10, 2025
Figure 1 for UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Figure 2 for UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Figure 3 for UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Figure 4 for UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Viaarxiv icon

Transferable Dual-Domain Feature Importance Attack against AI-Generated Image Detector

Add code
Nov 19, 2025
Viaarxiv icon

ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation

Add code
Nov 18, 2025
Figure 1 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 2 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 3 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 4 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Viaarxiv icon

OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs

Add code
Nov 18, 2025
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Add code
Nov 17, 2025
Figure 1 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 2 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 3 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 4 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Viaarxiv icon

DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval

Add code
Nov 16, 2025
Figure 1 for DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval
Figure 2 for DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval
Figure 3 for DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval
Figure 4 for DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval
Viaarxiv icon