Picture for Pan Zhou

Pan Zhou

The Hubei Engineering Research Center on Big Data Security, School of Cyber Science and Engineering, Huazhong University of Science and Technology

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

Add code
May 29, 2025
Viaarxiv icon

Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM

Add code
May 29, 2025
Viaarxiv icon

Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios

Add code
May 23, 2025
Viaarxiv icon

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Add code
May 22, 2025
Viaarxiv icon

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Add code
May 22, 2025
Viaarxiv icon

ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment

Add code
May 08, 2025
Viaarxiv icon

Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes

Add code
May 03, 2025
Viaarxiv icon

Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities

Add code
Mar 14, 2025
Viaarxiv icon

DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image

Add code
Mar 13, 2025
Viaarxiv icon

A Survey on Post-training of Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon