Picture for Ping Luo

Ping Luo

ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning

Add code
Jan 10, 2024
Viaarxiv icon

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Add code
Jan 10, 2024
Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Jan 04, 2024
Figure 1 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 2 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 3 for LLaMA Pro: Progressive LLaMA with Block Expansion
Figure 4 for LLaMA Pro: Progressive LLaMA with Block Expansion
Viaarxiv icon

Video Understanding with Large Language Models: A Survey

Add code
Jan 04, 2024
Figure 1 for Video Understanding with Large Language Models: A Survey
Figure 2 for Video Understanding with Large Language Models: A Survey
Figure 3 for Video Understanding with Large Language Models: A Survey
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

Add code
Dec 25, 2023
Viaarxiv icon

DriveLM: Driving with Graph Visual Question Answering

Add code
Dec 21, 2023
Figure 1 for DriveLM: Driving with Graph Visual Question Answering
Figure 2 for DriveLM: Driving with Graph Visual Question Answering
Figure 3 for DriveLM: Driving with Graph Visual Question Answering
Figure 4 for DriveLM: Driving with Graph Visual Question Answering
Viaarxiv icon

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Add code
Dec 20, 2023
Viaarxiv icon

SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution

Add code
Dec 18, 2023
Figure 1 for SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Figure 2 for SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Figure 3 for SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Figure 4 for SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Viaarxiv icon

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

Add code
Dec 09, 2023
Viaarxiv icon