Picture for Jiuxiang Gu

Jiuxiang Gu

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation

Add code
Dec 02, 2024
Figure 1 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 2 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 3 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 4 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Viaarxiv icon

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Figure 1 for LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding
Figure 2 for LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding
Figure 3 for LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding
Figure 4 for LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding
Viaarxiv icon

Personalization of Large Language Models: A Survey

Add code
Oct 29, 2024
Viaarxiv icon

A Survey of Small Language Models

Add code
Oct 25, 2024
Figure 1 for A Survey of Small Language Models
Figure 2 for A Survey of Small Language Models
Figure 3 for A Survey of Small Language Models
Viaarxiv icon

VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use

Add code
Oct 21, 2024
Figure 1 for VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Figure 2 for VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Figure 3 for VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Figure 4 for VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Viaarxiv icon

TextLap: Customizing Language Models for Text-to-Layout Planning

Add code
Oct 09, 2024
Figure 1 for TextLap: Customizing Language Models for Text-to-Layout Planning
Figure 2 for TextLap: Customizing Language Models for Text-to-Layout Planning
Figure 3 for TextLap: Customizing Language Models for Text-to-Layout Planning
Figure 4 for TextLap: Customizing Language Models for Text-to-Layout Planning
Viaarxiv icon

ImageFolder: Autoregressive Image Generation with Folded Tokens

Add code
Oct 02, 2024
Figure 1 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 2 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 3 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 4 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Viaarxiv icon

A Multi-LLM Debiasing Framework

Add code
Sep 20, 2024
Figure 1 for A Multi-LLM Debiasing Framework
Figure 2 for A Multi-LLM Debiasing Framework
Figure 3 for A Multi-LLM Debiasing Framework
Figure 4 for A Multi-LLM Debiasing Framework
Viaarxiv icon