Picture for Jianlong Fu

Jianlong Fu

A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation

Add code
Oct 19, 2021
Figure 1 for A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation
Figure 2 for A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation
Figure 3 for A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation
Viaarxiv icon

Learning Fine-Grained Motion Embedding for Landscape Animation

Add code
Sep 13, 2021
Figure 1 for Learning Fine-Grained Motion Embedding for Landscape Animation
Figure 2 for Learning Fine-Grained Motion Embedding for Landscape Animation
Figure 3 for Learning Fine-Grained Motion Embedding for Landscape Animation
Figure 4 for Learning Fine-Grained Motion Embedding for Landscape Animation
Viaarxiv icon

Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

Add code
Aug 18, 2021
Figure 1 for Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
Figure 2 for Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
Figure 3 for Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
Figure 4 for Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
Viaarxiv icon

Domain-Aware Universal Style Transfer

Add code
Aug 17, 2021
Figure 1 for Domain-Aware Universal Style Transfer
Figure 2 for Domain-Aware Universal Style Transfer
Figure 3 for Domain-Aware Universal Style Transfer
Figure 4 for Domain-Aware Universal Style Transfer
Viaarxiv icon

Reference-based Defect Detection Network

Add code
Aug 10, 2021
Figure 1 for Reference-based Defect Detection Network
Figure 2 for Reference-based Defect Detection Network
Figure 3 for Reference-based Defect Detection Network
Figure 4 for Reference-based Defect Detection Network
Viaarxiv icon

Rethinking and Improving Relative Position Encoding for Vision Transformer

Add code
Jul 29, 2021
Figure 1 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 2 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 3 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 4 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Viaarxiv icon

AutoFormer: Searching Transformers for Visual Recognition

Add code
Jul 01, 2021
Figure 1 for AutoFormer: Searching Transformers for Visual Recognition
Figure 2 for AutoFormer: Searching Transformers for Visual Recognition
Figure 3 for AutoFormer: Searching Transformers for Visual Recognition
Figure 4 for AutoFormer: Searching Transformers for Visual Recognition
Viaarxiv icon

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training

Add code
Jun 28, 2021
Figure 1 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 2 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 3 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 4 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Viaarxiv icon

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search

Add code
Apr 29, 2021
Figure 1 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 2 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 3 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 4 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Viaarxiv icon

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Add code
Apr 08, 2021
Figure 1 for Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Figure 2 for Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Figure 3 for Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Figure 4 for Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Viaarxiv icon