Picture for Peng Xu

Peng Xu

Google

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Add code
Apr 24, 2024
Figure 1 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 2 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 3 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 4 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Viaarxiv icon

LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion

Add code
Apr 07, 2024
Figure 1 for LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Figure 2 for LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Figure 3 for LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Figure 4 for LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Viaarxiv icon

CACA Agent: Capability Collaboration based AI Agent

Add code
Mar 22, 2024
Viaarxiv icon

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

Add code
Mar 16, 2024
Viaarxiv icon

Learning-driven Physically-aware Large-scale Circuit Gate Sizing

Add code
Mar 13, 2024
Figure 1 for Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Figure 2 for Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Figure 3 for Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Figure 4 for Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Viaarxiv icon

RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches

Add code
Mar 05, 2024
Figure 1 for RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Figure 2 for RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Figure 3 for RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Figure 4 for RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Viaarxiv icon

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

Add code
Feb 28, 2024
Figure 1 for IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding
Figure 2 for IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding
Figure 3 for IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding
Figure 4 for IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

Add code
Feb 18, 2024
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon