Picture for Qi Song

Qi Song

Towards Long-window Anchoring in Vision-Language Model Distillation

Add code
Dec 25, 2025
Viaarxiv icon

CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning

Add code
Dec 19, 2025
Viaarxiv icon

GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt

Add code
Nov 13, 2025
Viaarxiv icon

Align 3D Representation and Text Embedding for 3D Content Personalization

Add code
Aug 23, 2025
Viaarxiv icon

CrossLinear: Plug-and-Play Cross-Correlation Embedding for Time Series Forecasting with Exogenous Variables

Add code
May 29, 2025
Viaarxiv icon

Advanced long-term earth system forecasting by learning the small-scale nature

Add code
May 26, 2025
Viaarxiv icon

ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs

Add code
Apr 01, 2025
Viaarxiv icon

Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale Matching

Add code
Feb 28, 2025
Viaarxiv icon

KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs

Add code
Feb 17, 2025
Figure 1 for KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs
Figure 2 for KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs
Figure 3 for KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs
Figure 4 for KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs
Viaarxiv icon

PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression

Add code
Feb 07, 2025
Viaarxiv icon