Picture for Peng Zhou

Peng Zhou

UniBiDex: A Unified Teleoperation Framework for Robotic Bimanual Dexterous Manipulation

Add code
Jan 08, 2026
Viaarxiv icon

Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs

Add code
Dec 18, 2025
Figure 1 for Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Figure 2 for Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Figure 3 for Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Figure 4 for Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Viaarxiv icon

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Add code
Dec 16, 2025
Figure 1 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 2 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 3 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 4 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Viaarxiv icon

Kunlun Anomaly Troubleshooter: Enabling Kernel-Level Anomaly Detection and Causal Reasoning for Large Model Distributed Inference

Add code
Nov 08, 2025
Viaarxiv icon

BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging

Add code
Sep 11, 2025
Figure 1 for BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging
Figure 2 for BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging
Figure 3 for BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging
Figure 4 for BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging
Viaarxiv icon

SpikingBrain Technical Report: Spiking Brain-inspired Large Models

Add code
Sep 05, 2025
Viaarxiv icon

Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models

Add code
Aug 24, 2025
Figure 1 for Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models
Figure 2 for Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models
Figure 3 for Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models
Figure 4 for Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models
Viaarxiv icon

Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation

Add code
Mar 11, 2025
Viaarxiv icon

M-LLM Based Video Frame Selection for Efficient Video Understanding

Add code
Feb 27, 2025
Viaarxiv icon

A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization

Add code
Feb 18, 2025
Figure 1 for A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization
Figure 2 for A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization
Figure 3 for A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization
Figure 4 for A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization
Viaarxiv icon