Picture for Ziqi Zhang

Ziqi Zhang

Peking University

STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs

Add code
May 26, 2025
Viaarxiv icon

DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval

Add code
May 23, 2025
Viaarxiv icon

STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs

Add code
May 21, 2025
Viaarxiv icon

Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous Models

Add code
Mar 13, 2025
Viaarxiv icon

Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?

Add code
Dec 05, 2024
Figure 1 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 2 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 3 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 4 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Viaarxiv icon

Enhancing Recommendation Systems with GNNs and Addressing Over-Smoothing

Add code
Dec 04, 2024
Viaarxiv icon

An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction

Add code
Dec 03, 2024
Figure 1 for An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction
Figure 2 for An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction
Figure 3 for An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction
Viaarxiv icon

RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

Add code
Nov 27, 2024
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models

Add code
Nov 15, 2024
Viaarxiv icon