Picture for Qifan Wang

Qifan Wang

Meta AI

A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning

Add code
Mar 25, 2026
Viaarxiv icon

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Add code
Mar 15, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon

DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding

Add code
Mar 09, 2026
Viaarxiv icon

Verifiable Reasoning for LLM-based Generative Recommendation

Add code
Mar 08, 2026
Viaarxiv icon

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

Add code
Feb 23, 2026
Viaarxiv icon

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

Add code
Feb 19, 2026
Viaarxiv icon

Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

Add code
Feb 18, 2026
Viaarxiv icon

Bringing Reasoning to Generative Recommendation Through the Lens of Cascaded Ranking

Add code
Feb 03, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon