Picture for Siliang Tang

Siliang Tang

STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training

Add code
Nov 29, 2024
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon

GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs

Add code
Oct 15, 2024
Figure 1 for GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
Figure 2 for GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
Figure 3 for GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
Figure 4 for GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
Viaarxiv icon

RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection

Add code
Oct 02, 2024
Viaarxiv icon

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Add code
Sep 30, 2024
Figure 1 for Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
Figure 2 for Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
Figure 3 for Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
Figure 4 for Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
Viaarxiv icon

Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation

Add code
Sep 27, 2024
Figure 1 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 2 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 3 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 4 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Viaarxiv icon

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

Add code
Aug 19, 2024
Figure 1 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 2 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 3 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 4 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Viaarxiv icon

E-CGL: An Efficient Continual Graph Learner

Add code
Aug 18, 2024
Viaarxiv icon

Graph Retrieval-Augmented Generation: A Survey

Add code
Aug 15, 2024
Figure 1 for Graph Retrieval-Augmented Generation: A Survey
Figure 2 for Graph Retrieval-Augmented Generation: A Survey
Figure 3 for Graph Retrieval-Augmented Generation: A Survey
Figure 4 for Graph Retrieval-Augmented Generation: A Survey
Viaarxiv icon