Picture for Qifan Wang

Qifan Wang

Meta AI

Teacher-Guided Policy Optimization for LLM Distillation

Add code
May 13, 2026
Viaarxiv icon

Remember the Decision, Not the Description: A Rate-Distortion Framework for Agent Memory

Add code
May 11, 2026
Viaarxiv icon

Micro-Defects Expose Macro-Fakes: Detecting AI-Generated Images via Local Distributional Shifts

Add code
May 10, 2026
Viaarxiv icon

Objective Shaping with Hard Negatives: Windowed Partial AUC Optimization for RL-based LLM Recommenders

Add code
Apr 24, 2026
Viaarxiv icon

TrEEStealer: Stealing Decision Trees via Enclave Side Channels

Add code
Apr 20, 2026
Viaarxiv icon

Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization

Add code
Apr 14, 2026
Viaarxiv icon

CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

Add code
Apr 10, 2026
Viaarxiv icon

A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning

Add code
Mar 25, 2026
Viaarxiv icon

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Add code
Mar 15, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon