Picture for Wei Wei

Wei Wei

Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces

Add code
Jun 06, 2025
Viaarxiv icon

ProRefine: Inference-time Prompt Refinement with Textual Feedback

Add code
Jun 05, 2025
Viaarxiv icon

Risk-aware Direct Preference Optimization under Nested Risk Measure

Add code
May 29, 2025
Viaarxiv icon

SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards

Add code
May 25, 2025
Viaarxiv icon

Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning

Add code
May 25, 2025
Viaarxiv icon

Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control

Add code
May 23, 2025
Viaarxiv icon

No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves

Add code
May 05, 2025
Viaarxiv icon

Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled yet Hard-to-Learn Samples in Noisy Data

Add code
Apr 24, 2025
Viaarxiv icon

AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG

Add code
Apr 21, 2025
Viaarxiv icon

Fourier Feature Attribution: A New Efficiency Attribution Method

Add code
Apr 02, 2025
Viaarxiv icon