Picture for Boyi Li

Boyi Li

Describe Anything: Detailed Localized Image and Video Captioning

Add code
Apr 22, 2025
Viaarxiv icon

Optimization of MedSAM model based on bounding box adaptive perturbation algorithm

Add code
Mar 25, 2025
Viaarxiv icon

Scaling Vision Pre-Training to 4K Resolution

Add code
Mar 25, 2025
Viaarxiv icon

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Add code
Mar 16, 2025
Viaarxiv icon

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Add code
Jan 29, 2025
Viaarxiv icon

DreamDrive: Generative 4D Scene Modeling from Street View Images

Add code
Jan 03, 2025
Figure 1 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 2 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 3 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 4 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Viaarxiv icon

STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes

Add code
Dec 31, 2024
Figure 1 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 2 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 3 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 4 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Viaarxiv icon

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models

Add code
Dec 10, 2024
Figure 1 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 2 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 3 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 4 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Viaarxiv icon

Extrapolated Urban View Synthesis Benchmark

Add code
Dec 10, 2024
Figure 1 for Extrapolated Urban View Synthesis Benchmark
Figure 2 for Extrapolated Urban View Synthesis Benchmark
Figure 3 for Extrapolated Urban View Synthesis Benchmark
Figure 4 for Extrapolated Urban View Synthesis Benchmark
Viaarxiv icon

Promptable Closed-loop Traffic Simulation

Add code
Sep 09, 2024
Figure 1 for Promptable Closed-loop Traffic Simulation
Figure 2 for Promptable Closed-loop Traffic Simulation
Figure 3 for Promptable Closed-loop Traffic Simulation
Viaarxiv icon