Picture for Zhendong Mao

Zhendong Mao

Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models

Add code
May 06, 2026
Viaarxiv icon

Stream-T1: Test-Time Scaling for Streaming Video Generation

Add code
May 06, 2026
Viaarxiv icon

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Add code
May 05, 2026
Viaarxiv icon

CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers

Add code
Apr 21, 2026
Viaarxiv icon

A Multi-Agent Framework with Structured Reasoning and Reflective Refinement for Multimodal Empathetic Response Generation

Add code
Apr 21, 2026
Viaarxiv icon

FACE-net: Factual Calibration and Emotion Augmentation for Retrieval-enhanced Emotional Video Captioning

Add code
Mar 18, 2026
Viaarxiv icon

CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling

Add code
Mar 09, 2026
Viaarxiv icon

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Add code
Feb 03, 2026
Viaarxiv icon

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Add code
Feb 03, 2026
Viaarxiv icon

Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles

Add code
Feb 03, 2026
Viaarxiv icon