Picture for Jun Peng

Jun Peng

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Add code
Jun 11, 2025
Viaarxiv icon

Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis

Add code
Apr 14, 2025
Viaarxiv icon

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection

Add code
Feb 08, 2025
Viaarxiv icon

TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning

Add code
Dec 11, 2024
Figure 1 for TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
Figure 2 for TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
Figure 3 for TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
Figure 4 for TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
Viaarxiv icon

Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion

Add code
Jun 28, 2024
Figure 1 for Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion
Figure 2 for Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion
Figure 3 for Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion
Figure 4 for Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion
Viaarxiv icon

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Add code
Mar 11, 2024
Figure 1 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 2 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 3 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 4 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Viaarxiv icon

Towards Training A Chinese Large Language Model for Anesthesiology

Add code
Mar 05, 2024
Viaarxiv icon