Picture for Kun Ding

Kun Ding

Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding

Add code
Mar 11, 2026
Viaarxiv icon

SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation

Add code
Mar 02, 2026
Viaarxiv icon

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering

Add code
Feb 27, 2026
Viaarxiv icon

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

Add code
Feb 22, 2026
Viaarxiv icon

Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions

Add code
Feb 10, 2026
Viaarxiv icon

DSFC-Net: A Dual-Encoder Spatial and Frequency Co-Awareness Network for Rural Road Extraction

Add code
Feb 01, 2026
Viaarxiv icon

PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving

Add code
Dec 22, 2025
Figure 1 for PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving
Figure 2 for PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving
Figure 3 for PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving
Figure 4 for PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving
Viaarxiv icon

Fault Diagnosis and Quantification for Photovoltaic Arrays based on Differentiable Physical Models

Add code
Dec 18, 2025
Figure 1 for Fault Diagnosis and Quantification for Photovoltaic Arrays based on Differentiable Physical Models
Figure 2 for Fault Diagnosis and Quantification for Photovoltaic Arrays based on Differentiable Physical Models
Figure 3 for Fault Diagnosis and Quantification for Photovoltaic Arrays based on Differentiable Physical Models
Figure 4 for Fault Diagnosis and Quantification for Photovoltaic Arrays based on Differentiable Physical Models
Viaarxiv icon

IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting

Add code
Dec 10, 2025
Viaarxiv icon

Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger

Add code
Jun 09, 2025
Viaarxiv icon