Picture for Chenxiao Zhao

Chenxiao Zhao

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Add code
Mar 04, 2026
Viaarxiv icon

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Add code
Feb 15, 2026
Viaarxiv icon

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Add code
Feb 11, 2026
Viaarxiv icon

DeepEyesV2: Toward Agentic Multimodal Model

Add code
Nov 10, 2025
Viaarxiv icon

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

The Adversarial Attack and Detection under the Fisher Information Metric

Add code
Oct 09, 2018
Figure 1 for The Adversarial Attack and Detection under the Fisher Information Metric
Figure 2 for The Adversarial Attack and Detection under the Fisher Information Metric
Figure 3 for The Adversarial Attack and Detection under the Fisher Information Metric
Figure 4 for The Adversarial Attack and Detection under the Fisher Information Metric
Viaarxiv icon