Picture for Ming Zhang

Ming Zhang

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Add code
Nov 06, 2025
Viaarxiv icon

A Survey on Efficient Large Language Model Training: From Data-centric Perspectives

Add code
Oct 29, 2025
Viaarxiv icon

Automated Genomic Interpretation via Concept Bottleneck Models for Medical Robotics

Add code
Oct 02, 2025
Viaarxiv icon

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

Add code
Oct 01, 2025
Viaarxiv icon

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

Add code
Sep 26, 2025
Viaarxiv icon

Type-Compliant Adaptation Cascades: Adapting Programmatic LM Workflows to Data

Add code
Aug 25, 2025
Viaarxiv icon

LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Physical Adversarial Camouflage through Gradient Calibration and Regularization

Add code
Aug 07, 2025
Viaarxiv icon

An Explainable Emotion Alignment Framework for LLM-Empowered Agent in Metaverse Service Ecosystem

Add code
Jul 30, 2025
Figure 1 for An Explainable Emotion Alignment Framework for LLM-Empowered Agent in Metaverse Service Ecosystem
Figure 2 for An Explainable Emotion Alignment Framework for LLM-Empowered Agent in Metaverse Service Ecosystem
Figure 3 for An Explainable Emotion Alignment Framework for LLM-Empowered Agent in Metaverse Service Ecosystem
Figure 4 for An Explainable Emotion Alignment Framework for LLM-Empowered Agent in Metaverse Service Ecosystem
Viaarxiv icon

Sparse Causal Discovery with Generative Intervention for Unsupervised Graph Domain Adaptation

Add code
Jul 10, 2025
Viaarxiv icon