Picture for Xiaodan Liang

Xiaodan Liang

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Add code
Oct 12, 2023
Figure 1 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 2 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 3 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 4 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Viaarxiv icon

Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images

Add code
Aug 31, 2023
Figure 1 for Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Figure 2 for Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Figure 3 for Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Figure 4 for Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Viaarxiv icon

DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment

Add code
Aug 22, 2023
Viaarxiv icon

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training

Add code
Aug 22, 2023
Figure 1 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 2 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 3 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 4 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Viaarxiv icon

Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos

Add code
Aug 20, 2023
Figure 1 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 2 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 3 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 4 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Viaarxiv icon

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Add code
Aug 18, 2023
Viaarxiv icon

CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation

Add code
Aug 14, 2023
Figure 1 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 2 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 3 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 4 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Viaarxiv icon

LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts

Add code
Aug 13, 2023
Figure 1 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 2 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 3 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 4 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Viaarxiv icon

MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation

Add code
Aug 09, 2023
Figure 1 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 2 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 3 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 4 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Viaarxiv icon

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Add code
Jul 31, 2023
Figure 1 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 2 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 3 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 4 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Viaarxiv icon