Picture for Yueting Zhuang

Yueting Zhuang

Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness

Add code
Dec 09, 2024
Figure 1 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 2 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 3 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 4 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Viaarxiv icon

STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training

Add code
Nov 29, 2024
Viaarxiv icon

Spatially Visual Perception for End-to-End Robotic Learning

Add code
Nov 26, 2024
Figure 1 for Spatially Visual Perception for End-to-End Robotic Learning
Figure 2 for Spatially Visual Perception for End-to-End Robotic Learning
Figure 3 for Spatially Visual Perception for End-to-End Robotic Learning
Figure 4 for Spatially Visual Perception for End-to-End Robotic Learning
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models

Add code
Nov 14, 2024
Figure 1 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 2 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 3 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 4 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Viaarxiv icon

GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation

Add code
Oct 15, 2024
Figure 1 for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
Figure 2 for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
Figure 3 for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
Figure 4 for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
Viaarxiv icon

RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection

Add code
Oct 02, 2024
Viaarxiv icon

Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation

Add code
Sep 27, 2024
Figure 1 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 2 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 3 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 4 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Viaarxiv icon

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

Add code
Aug 19, 2024
Figure 1 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 2 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 3 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 4 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Viaarxiv icon

Logic Distillation: Learning from Code Function by Function for Planning and Decision-making

Add code
Jul 28, 2024
Figure 1 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 2 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 3 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 4 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Viaarxiv icon