Picture for Ping Luo

Ping Luo

ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation

Add code
Mar 11, 2024
Viaarxiv icon

Towards Implicit Prompt For Text-To-Image Models

Add code
Mar 08, 2024
Figure 1 for Towards Implicit Prompt For Text-To-Image Models
Figure 2 for Towards Implicit Prompt For Text-To-Image Models
Figure 3 for Towards Implicit Prompt For Text-To-Image Models
Figure 4 for Towards Implicit Prompt For Text-To-Image Models
Viaarxiv icon

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Add code
Mar 07, 2024
Figure 1 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 2 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 3 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 4 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Viaarxiv icon

RegionGPT: Towards Region Understanding Vision Language Model

Add code
Mar 04, 2024
Figure 1 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 2 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 3 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 4 for RegionGPT: Towards Region Understanding Vision Language Model
Viaarxiv icon

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Add code
Feb 25, 2024
Figure 1 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 2 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 3 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 4 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Viaarxiv icon

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

Add code
Feb 23, 2024
Figure 1 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 2 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 3 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Figure 4 for AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Viaarxiv icon

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

Add code
Feb 22, 2024
Figure 1 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 2 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 3 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Figure 4 for RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Viaarxiv icon

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

Add code
Feb 18, 2024
Viaarxiv icon

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

Add code
Feb 14, 2024
Figure 1 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 2 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 3 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Figure 4 for OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Jan 15, 2024
Viaarxiv icon