Picture for Quanjun Yin

Quanjun Yin

Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events

Add code
May 22, 2025
Viaarxiv icon

Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology

Add code
May 14, 2025
Viaarxiv icon

GeoNav: Empowering MLLMs with Explicit Geospatial Reasoning Abilities for Language-Goal Aerial Navigation

Add code
Apr 13, 2025
Viaarxiv icon

SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding

Add code
Feb 24, 2025
Figure 1 for SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Figure 2 for SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Figure 3 for SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Figure 4 for SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Viaarxiv icon

Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model

Add code
Nov 16, 2024
Viaarxiv icon

Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration

Add code
Oct 09, 2024
Figure 1 for Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration
Figure 2 for Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration
Figure 3 for Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration
Figure 4 for Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration
Viaarxiv icon

OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement

Add code
Oct 09, 2024
Viaarxiv icon

M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension

Add code
Jul 01, 2024
Figure 1 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 2 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 3 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 4 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Viaarxiv icon

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Add code
May 23, 2024
Figure 1 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 2 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 3 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 4 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Viaarxiv icon

DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

Add code
May 10, 2024
Viaarxiv icon