Picture for Peng Li

Peng Li

DJI Innovations Inc

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

Add code
Jan 10, 2024
Figure 1 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 2 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 3 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 4 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Viaarxiv icon

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

Add code
Dec 25, 2023
Figure 1 for Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
Figure 2 for Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
Figure 3 for Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
Figure 4 for Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
Viaarxiv icon

Shai: A large language model for asset management

Add code
Dec 21, 2023
Viaarxiv icon

Cross-BERT for Point Cloud Pretraining

Add code
Dec 08, 2023
Figure 1 for Cross-BERT for Point Cloud Pretraining
Figure 2 for Cross-BERT for Point Cloud Pretraining
Figure 3 for Cross-BERT for Point Cloud Pretraining
Figure 4 for Cross-BERT for Point Cloud Pretraining
Viaarxiv icon

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

Add code
Nov 29, 2023
Figure 1 for Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Figure 2 for Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Figure 3 for Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Figure 4 for Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Viaarxiv icon

Topology-Preserving Adversarial Training

Add code
Nov 29, 2023
Figure 1 for Topology-Preserving Adversarial Training
Figure 2 for Topology-Preserving Adversarial Training
Figure 3 for Topology-Preserving Adversarial Training
Figure 4 for Topology-Preserving Adversarial Training
Viaarxiv icon

Adversarial Robust Memory-Based Continual Learner

Add code
Nov 29, 2023
Figure 1 for Adversarial Robust Memory-Based Continual Learner
Figure 2 for Adversarial Robust Memory-Based Continual Learner
Figure 3 for Adversarial Robust Memory-Based Continual Learner
Figure 4 for Adversarial Robust Memory-Based Continual Learner
Viaarxiv icon

Can Vision-Language Models Think from a First-Person Perspective?

Add code
Nov 27, 2023
Figure 1 for Can Vision-Language Models Think from a First-Person Perspective?
Figure 2 for Can Vision-Language Models Think from a First-Person Perspective?
Figure 3 for Can Vision-Language Models Think from a First-Person Perspective?
Figure 4 for Can Vision-Language Models Think from a First-Person Perspective?
Viaarxiv icon

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions

Add code
Nov 20, 2023
Figure 1 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 2 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 3 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 4 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Viaarxiv icon

DISTA: Denoising Spiking Transformer with intrinsic plasticity and spatiotemporal attention

Add code
Nov 15, 2023
Viaarxiv icon