Picture for Qi Zhang

Qi Zhang

School of Information, North China University of Technology

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction

Add code
Oct 16, 2025
Viaarxiv icon

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

Add code
Oct 01, 2025
Viaarxiv icon

Query-Kontext: An Unified Multimodal Model for Image Generation and Editing

Add code
Sep 30, 2025
Figure 1 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 2 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 3 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 4 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Viaarxiv icon

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

Add code
Sep 26, 2025
Viaarxiv icon

TASAM: Terrain-and-Aware Segment Anything Model for Temporal-Scale Remote Sensing Segmentation

Add code
Sep 19, 2025
Viaarxiv icon

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Add code
Sep 19, 2025
Viaarxiv icon

Hint: hierarchical inter-frame correlation for one-shot point cloud sequence compression

Add code
Sep 18, 2025
Viaarxiv icon

CUFG: Curriculum Unlearning Guided by the Forgetting Gradient

Add code
Sep 18, 2025
Viaarxiv icon

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

TinySR: Pruning Diffusion for Real-World Image Super-Resolution

Add code
Aug 24, 2025
Viaarxiv icon