Picture for Pei Zhou

Pei Zhou

School of Optoelectronic Science and Engineering and Collaborative Innovation Center of Suzhou Nano Science and Technology, Soochow University, Suzhou 215006, China, Key Lab of Advanced Optical Manufacturing Technologies of Jiangsu Province and Key Lab of Modern Optical Technologies of Education Ministry of China, Soochow University, Suzhou 215006, China, Key Laboratory of Radar Imaging and Microwave Photonics, Ministry of Education, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

One Model, All Roles: Multi-Turn, Multi-Agent Self-Play Reinforcement Learning for Conversational Social Intelligence

Add code
Feb 03, 2026
Viaarxiv icon

Beyond Output Critique: Self-Correction via Task Distillation

Add code
Jan 31, 2026
Viaarxiv icon

Spatial4D-Bench: A Versatile 4D Spatial Intelligence Benchmark

Add code
Dec 31, 2025
Viaarxiv icon

Learning Human-Humanoid Coordination for Collaborative Object Carrying

Add code
Oct 16, 2025
Viaarxiv icon

Skeleton-based sign language recognition using a dual-stream spatio-temporal dynamic graph convolutional network

Add code
Sep 10, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Image Denoising Challenge Report

Add code
Apr 16, 2025
Figure 1 for The Tenth NTIRE 2025 Image Denoising Challenge Report
Figure 2 for The Tenth NTIRE 2025 Image Denoising Challenge Report
Figure 3 for The Tenth NTIRE 2025 Image Denoising Challenge Report
Figure 4 for The Tenth NTIRE 2025 Image Denoising Challenge Report
Viaarxiv icon

GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation

Add code
Feb 26, 2025
Viaarxiv icon

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Add code
Aug 28, 2024
Figure 1 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 2 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 3 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 4 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Viaarxiv icon

MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery

Add code
Jul 21, 2024
Figure 1 for MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Figure 2 for MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Figure 3 for MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Figure 4 for MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Viaarxiv icon

InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context

Add code
Jun 18, 2024
Viaarxiv icon