Picture for David Zhang

David Zhang

A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

Add code
Mar 25, 2026
Viaarxiv icon

Prompt Tuning for CLIP on the Pretrained Manifold

Add code
Feb 22, 2026
Viaarxiv icon

Dynamic Prior Thompson Sampling for Cold-Start Exploration in Recommender Systems

Add code
Feb 01, 2026
Viaarxiv icon

A Cosine Network for Image Super-Resolution

Add code
Jan 23, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Toward Training Superintelligent Software Agents through Self-Play SWE-RL

Add code
Dec 21, 2025
Viaarxiv icon

Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

Add code
Nov 14, 2025
Viaarxiv icon

Domain Gating Ensemble Networks for AI-Generated Text Detection

Add code
May 20, 2025
Viaarxiv icon

Compact Recurrent Transformer with Persistent Memory

Add code
May 02, 2025
Figure 1 for Compact Recurrent Transformer with Persistent Memory
Figure 2 for Compact Recurrent Transformer with Persistent Memory
Figure 3 for Compact Recurrent Transformer with Persistent Memory
Figure 4 for Compact Recurrent Transformer with Persistent Memory
Viaarxiv icon

CaLMFlow: Volterra Flow Matching using Causal Language Models

Add code
Oct 03, 2024
Viaarxiv icon