Picture for Ping Luo

Ping Luo

HRVMamba: High-Resolution Visual State Space Model for Dense Prediction

Add code
Oct 04, 2024
Viaarxiv icon

Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking

Add code
Sep 24, 2024
Viaarxiv icon

Prior Knowledge Distillation Network for Face Super-Resolution

Add code
Sep 22, 2024
Figure 1 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 2 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 3 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 4 for Prior Knowledge Distillation Network for Face Super-Resolution
Viaarxiv icon

Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation

Add code
Sep 13, 2024
Figure 1 for Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
Figure 2 for Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
Figure 3 for Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
Figure 4 for Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
Viaarxiv icon

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)

Add code
Sep 04, 2024
Figure 1 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 2 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 3 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 4 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Viaarxiv icon

Federated Prediction-Powered Inference from Decentralized Data

Add code
Sep 03, 2024
Figure 1 for Federated Prediction-Powered Inference from Decentralized Data
Figure 2 for Federated Prediction-Powered Inference from Decentralized Data
Figure 3 for Federated Prediction-Powered Inference from Decentralized Data
Figure 4 for Federated Prediction-Powered Inference from Decentralized Data
Viaarxiv icon

Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing

Add code
Aug 23, 2024
Viaarxiv icon

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Add code
Aug 18, 2024
Viaarxiv icon

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Add code
Aug 05, 2024
Figure 1 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 2 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 3 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 4 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Viaarxiv icon

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

Add code
Aug 01, 2024
Viaarxiv icon