Picture for Hao Tian

Hao Tian

Sichuan University

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision

Add code
Nov 18, 2022
Viaarxiv icon

Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models

Add code
Nov 14, 2022
Figure 1 for Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Figure 2 for Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Figure 3 for Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Figure 4 for Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Viaarxiv icon

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation

Add code
Nov 14, 2022
Figure 1 for Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Figure 2 for Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Figure 3 for Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Figure 4 for Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Viaarxiv icon

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Add code
Nov 09, 2022
Figure 1 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 2 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 3 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 4 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Viaarxiv icon

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

Add code
Oct 27, 2022
Figure 1 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 2 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 3 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 4 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Viaarxiv icon

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

Add code
Oct 14, 2022
Figure 1 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 2 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 3 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 4 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Viaarxiv icon

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

Add code
Sep 30, 2022
Figure 1 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 2 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 3 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 4 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Viaarxiv icon

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Add code
Sep 12, 2022
Figure 1 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 2 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 3 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 4 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Viaarxiv icon

Graph Kernels Based on Multi-scale Graph Embeddings

Add code
Jun 02, 2022
Figure 1 for Graph Kernels Based on Multi-scale Graph Embeddings
Figure 2 for Graph Kernels Based on Multi-scale Graph Embeddings
Figure 3 for Graph Kernels Based on Multi-scale Graph Embeddings
Figure 4 for Graph Kernels Based on Multi-scale Graph Embeddings
Viaarxiv icon

ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval

Add code
May 18, 2022
Figure 1 for ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval
Figure 2 for ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval
Figure 3 for ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval
Figure 4 for ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval
Viaarxiv icon