Picture for Mingcheng Li

Mingcheng Li

PersonaAnimator: Personalized Motion Transfer from Unconstrained Videos

Add code
Aug 27, 2025
Viaarxiv icon

COPO: Consistency-Aware Policy Optimization

Add code
Aug 06, 2025
Viaarxiv icon

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Add code
May 05, 2025
Viaarxiv icon

Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning

Add code
Nov 05, 2024
Figure 1 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 2 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 3 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 4 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Viaarxiv icon

MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration

Add code
Oct 17, 2024
Figure 1 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 2 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 3 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 4 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Viaarxiv icon

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators

Add code
Aug 22, 2024
Figure 1 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 2 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 3 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 4 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Viaarxiv icon

MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation

Add code
Aug 17, 2024
Figure 1 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 2 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 3 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 4 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Viaarxiv icon

HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction

Add code
Aug 17, 2024
Figure 1 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 2 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 3 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 4 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Viaarxiv icon

Faster Diffusion Action Segmentation

Add code
Aug 04, 2024
Figure 1 for Faster Diffusion Action Segmentation
Figure 2 for Faster Diffusion Action Segmentation
Figure 3 for Faster Diffusion Action Segmentation
Figure 4 for Faster Diffusion Action Segmentation
Viaarxiv icon

Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations

Add code
Jul 06, 2024
Figure 1 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 2 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 3 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 4 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Viaarxiv icon