Picture for Siliang Tang

Siliang Tang

Logic Distillation: Learning from Code Function by Function for Planning and Decision-making

Add code
Jul 28, 2024
Figure 1 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 2 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 3 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Figure 4 for Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Viaarxiv icon

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization

Add code
Jul 15, 2024
Figure 1 for IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization
Figure 2 for IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization
Figure 3 for IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization
Figure 4 for IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization
Viaarxiv icon

Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference

Add code
Jul 06, 2024
Figure 1 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 2 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 3 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 4 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Viaarxiv icon

Bridging Local Details and Global Context in Text-Attributed Graphs

Add code
Jun 18, 2024
Viaarxiv icon

Improving Large Models with Small models: Lower Costs and Better Performance

Add code
Jun 15, 2024
Viaarxiv icon

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

Add code
Jun 11, 2024
Viaarxiv icon

Auto-Encoding Morph-Tokens for Multimodal LLM

Add code
May 03, 2024
Viaarxiv icon

WorldGPT: Empowering LLM as Multimodal World Model

Add code
Apr 28, 2024
Figure 1 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 2 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 3 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 4 for WorldGPT: Empowering LLM as Multimodal World Model
Viaarxiv icon

LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation

Add code
Apr 23, 2024
Viaarxiv icon

Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

Add code
Apr 17, 2024
Figure 1 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 2 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 3 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 4 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Viaarxiv icon