Picture for Lu Yuan

Lu Yuan

Stephen

iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views

Add code
Dec 28, 2023
Viaarxiv icon

Learning Subject-Aware Cropping by Outpainting Professional Photos

Add code
Dec 19, 2023
Figure 1 for Learning Subject-Aware Cropping by Outpainting Professional Photos
Figure 2 for Learning Subject-Aware Cropping by Outpainting Professional Photos
Figure 3 for Learning Subject-Aware Cropping by Outpainting Professional Photos
Figure 4 for Learning Subject-Aware Cropping by Outpainting Professional Photos
Viaarxiv icon

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

Fully Authentic Visual Question Answering Dataset from Online Communities

Add code
Nov 27, 2023
Figure 1 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 2 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 3 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 4 for Fully Authentic Visual Question Answering Dataset from Online Communities
Viaarxiv icon

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Add code
Nov 10, 2023
Figure 1 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 2 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 3 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 4 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Viaarxiv icon

PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders

Add code
Nov 08, 2023
Viaarxiv icon

On the Hidden Waves of Image

Add code
Oct 19, 2023
Figure 1 for On the Hidden Waves of Image
Figure 2 for On the Hidden Waves of Image
Figure 3 for On the Hidden Waves of Image
Figure 4 for On the Hidden Waves of Image
Viaarxiv icon

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

Add code
Oct 18, 2023
Figure 1 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 2 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 3 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 4 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Viaarxiv icon

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following

Add code
Oct 18, 2023
Viaarxiv icon

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Add code
Sep 21, 2023
Viaarxiv icon