Picture for Ping Luo

Ping Luo

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

Add code
Feb 23, 2024
Figure 1 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 2 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 3 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 4 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Viaarxiv icon

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

Add code
Feb 22, 2024
Figure 1 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 2 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 3 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 4 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Viaarxiv icon

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

Add code
Feb 18, 2024
Viaarxiv icon

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

Add code
Feb 14, 2024
Figure 1 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 2 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 3 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 4 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Jan 15, 2024
Viaarxiv icon

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Add code
Jan 10, 2024
Viaarxiv icon

ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning

Add code
Jan 10, 2024
Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Jan 04, 2024
Figure 1 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 2 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 3 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 4 for LLaMA Pro: Progressive LLaMA with Block Expansion
Viaarxiv icon

Video Understanding with Large Language Models: A Survey

Add code
Jan 04, 2024
Figure 1 for Video Understanding with Large Language Models: A Survey
Figure 2 for Video Understanding with Large Language Models: A Survey
Figure 3 for Video Understanding with Large Language Models: A Survey
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon