Picture for Lin Song

Lin Song

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Add code
May 12, 2026
Viaarxiv icon

Thinking with Novel Views: A Systematic Analysis of Generative-Augmented Spatial Intelligence

Add code
May 11, 2026
Viaarxiv icon

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Add code
May 05, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon

ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks

Add code
Aug 28, 2025
Figure 1 for ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks
Figure 2 for ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks
Figure 3 for ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks
Figure 4 for ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks
Viaarxiv icon

LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Add code
Jun 13, 2025
Figure 1 for LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Figure 2 for LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Figure 3 for LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Figure 4 for LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Viaarxiv icon

DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval

Add code
May 23, 2025
Viaarxiv icon

TensorAR: Refinement is All You Need in Autoregressive Image Generation

Add code
May 22, 2025
Figure 1 for TensorAR: Refinement is All You Need in Autoregressive Image Generation
Figure 2 for TensorAR: Refinement is All You Need in Autoregressive Image Generation
Figure 3 for TensorAR: Refinement is All You Need in Autoregressive Image Generation
Figure 4 for TensorAR: Refinement is All You Need in Autoregressive Image Generation
Viaarxiv icon

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Add code
May 19, 2025
Figure 1 for MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Figure 2 for MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Figure 3 for MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Figure 4 for MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Viaarxiv icon

YOLO-UniOW: Efficient Universal Open-World Object Detection

Add code
Dec 30, 2024
Figure 1 for YOLO-UniOW: Efficient Universal Open-World Object Detection
Figure 2 for YOLO-UniOW: Efficient Universal Open-World Object Detection
Figure 3 for YOLO-UniOW: Efficient Universal Open-World Object Detection
Figure 4 for YOLO-UniOW: Efficient Universal Open-World Object Detection
Viaarxiv icon