Picture for Shuyi Zhang

Shuyi Zhang

FORCE: Efficient VLA Reinforcement Fine-Tuning via Value-Calibrated Warm-up and Self-Distillation

Add code
Jun 24, 2026
Viaarxiv icon

DMT-CBT: Longitudinal Therapeutic State Modeling for CBT Counseling

Add code
Jun 02, 2026
Viaarxiv icon

OneVLA: A Unified Framework for Embodied Tasks

Add code
May 31, 2026
Viaarxiv icon

The AI Hippocampus: How Far are We From Human Memory?

Add code
Jan 14, 2026
Viaarxiv icon

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Add code
Dec 19, 2025
Viaarxiv icon

Interpretable Reward Model via Sparse Autoencoder

Add code
Aug 12, 2025
Viaarxiv icon

Marine Chlorophyll Prediction and Driver Analysis based on LSTM-RF Hybrid Models

Add code
Aug 07, 2025
Viaarxiv icon

A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models

Add code
Aug 01, 2025
Figure 1 for A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
Figure 2 for A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
Figure 3 for A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
Figure 4 for A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
Viaarxiv icon

RoboBrain 2.0 Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

Add code
Jun 12, 2025
Figure 1 for Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought
Figure 2 for Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought
Figure 3 for Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought
Figure 4 for Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought
Viaarxiv icon