Picture for Feng Zhang

Feng Zhang

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

Add code
May 13, 2026
Viaarxiv icon

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Add code
May 11, 2026
Viaarxiv icon

Bridging Passive and Active: Enhancing Conversation Starter Recommendation via Active Expression Modeling

Add code
May 07, 2026
Viaarxiv icon

Earth-o1: A Grid-free Observation-native Atmospheric World Model

Add code
May 07, 2026
Viaarxiv icon

Toward Scalable Terminal Task Synthesis via Skill Graphs

Add code
Apr 28, 2026
Viaarxiv icon

IceBreaker for Conversational Agents: Breaking the First-Message Barrier with Personalized Starters

Add code
Apr 20, 2026
Viaarxiv icon

The Fourth Challenge on Image Super-Resolution ($\times$4) at NTIRE 2026: Benchmark Results and Method Overview

Add code
Apr 16, 2026
Viaarxiv icon

SEPTQ: A Simple and Effective Post-Training Quantization Paradigm for Large Language Models

Add code
Apr 11, 2026
Viaarxiv icon

WRAP++: Web discoveRy Amplified Pretraining

Add code
Apr 09, 2026
Viaarxiv icon

PolicyLong: Towards On-Policy Context Extension

Add code
Apr 09, 2026
Viaarxiv icon