Picture for Yueting Zhuang

Yueting Zhuang

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization

Add code
Jul 15, 2024
Viaarxiv icon

From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation

Add code
Jul 12, 2024
Viaarxiv icon

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Add code
Jul 10, 2024
Viaarxiv icon

Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference

Add code
Jul 06, 2024
Viaarxiv icon

Bridging Local Details and Global Context in Text-Attributed Graphs

Add code
Jun 18, 2024
Viaarxiv icon

Improving Large Models with Small models: Lower Costs and Better Performance

Add code
Jun 15, 2024
Viaarxiv icon

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

Add code
Jun 11, 2024
Figure 1 for T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
Figure 2 for T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
Figure 3 for T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
Figure 4 for T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
Viaarxiv icon

Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism

Add code
Jun 06, 2024
Figure 1 for Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism
Figure 2 for Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism
Figure 3 for Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism
Figure 4 for Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism
Viaarxiv icon

Auto-Encoding Morph-Tokens for Multimodal LLM

Add code
May 03, 2024
Viaarxiv icon

WorldGPT: Empowering LLM as Multimodal World Model

Add code
Apr 28, 2024
Figure 1 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 2 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 3 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 4 for WorldGPT: Empowering LLM as Multimodal World Model
Viaarxiv icon