Picture for Mingcheng Li

Mingcheng Li

FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning

Add code
Dec 14, 2025
Viaarxiv icon

Improving Multimodal Sentiment Analysis via Modality Optimization and Dynamic Primary Modality Selection

Add code
Nov 14, 2025
Viaarxiv icon

PersonaAnimator: Personalized Motion Transfer from Unconstrained Videos

Add code
Aug 27, 2025
Viaarxiv icon

COPO: Consistency-Aware Policy Optimization

Add code
Aug 06, 2025
Viaarxiv icon

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Add code
May 05, 2025
Viaarxiv icon

Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning

Add code
Nov 05, 2024
Figure 1 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 2 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 3 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Figure 4 for Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
Viaarxiv icon

MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration

Add code
Oct 17, 2024
Figure 1 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 2 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 3 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 4 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Viaarxiv icon

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators

Add code
Aug 22, 2024
Figure 1 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 2 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 3 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Figure 4 for Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Viaarxiv icon

MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation

Add code
Aug 17, 2024
Figure 1 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 2 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 3 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Figure 4 for MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Viaarxiv icon

HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction

Add code
Aug 17, 2024
Figure 1 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 2 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 3 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Figure 4 for HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Viaarxiv icon