Picture for Xiaodan Liang

Xiaodan Liang

ATG: Benchmarking Automated Theorem Generation for Generative Language Models

Add code
May 05, 2024
Figure 1 for ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Figure 2 for ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Figure 3 for ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Figure 4 for ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Viaarxiv icon

MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation

Add code
May 01, 2024
Figure 1 for MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Figure 2 for MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Figure 3 for MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Figure 4 for MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Viaarxiv icon

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

Add code
Apr 29, 2024
Figure 1 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 2 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 3 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 4 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Viaarxiv icon

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Add code
Apr 25, 2024
Figure 1 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 2 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 3 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 4 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Viaarxiv icon

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Add code
Apr 14, 2024
Figure 1 for DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Figure 2 for DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Figure 3 for DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Figure 4 for DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Viaarxiv icon

MLP Can Be A Good Transformer Learner

Add code
Apr 08, 2024
Viaarxiv icon

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Add code
Mar 18, 2024
Viaarxiv icon

DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

Add code
Mar 13, 2024
Viaarxiv icon

Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation

Add code
Mar 13, 2024
Figure 1 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 2 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 3 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 4 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Viaarxiv icon

NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

Add code
Mar 12, 2024
Figure 1 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 2 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 3 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 4 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Viaarxiv icon