Picture for Ming Li

Ming Li

School of Integrated Circuits, Peking University

CLS-RL: Image Classification with Rule-Based Reinforcement Learning

Add code
Mar 20, 2025
Figure 1 for CLS-RL: Image Classification with Rule-Based Reinforcement Learning
Figure 2 for CLS-RL: Image Classification with Rule-Based Reinforcement Learning
Figure 3 for CLS-RL: Image Classification with Rule-Based Reinforcement Learning
Figure 4 for CLS-RL: Image Classification with Rule-Based Reinforcement Learning
Viaarxiv icon

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Add code
Mar 19, 2025
Figure 1 for Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Figure 2 for Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Figure 3 for Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Figure 4 for Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Viaarxiv icon

Where do Large Vision-Language Models Look at when Answering Questions?

Add code
Mar 18, 2025
Viaarxiv icon

Joint Array Partitioning and Beamforming Designs in ISAC Systems: A Bayesian CRB Perspective

Add code
Mar 18, 2025
Viaarxiv icon

Low Range-Doppler Sidelobe ISAC Waveform Design: A Low-Complexity Approach

Add code
Mar 15, 2025
Viaarxiv icon

Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking

Add code
Mar 14, 2025
Viaarxiv icon

FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models

Add code
Mar 13, 2025
Figure 1 for FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models
Figure 2 for FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models
Figure 3 for FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models
Figure 4 for FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models
Viaarxiv icon

Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Add code
Mar 11, 2025
Figure 1 for Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models
Figure 2 for Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models
Figure 3 for Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models
Figure 4 for Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models
Viaarxiv icon

AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis

Add code
Mar 11, 2025
Viaarxiv icon

Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models

Add code
Mar 11, 2025
Figure 1 for Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models
Figure 2 for Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models
Figure 3 for Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models
Figure 4 for Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models
Viaarxiv icon