Picture for Defu Lian

Defu Lian

Model Specific Task Similarity for Vision Language Model Selection via Layer Conductance

Add code
Feb 01, 2026
Viaarxiv icon

Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective

Add code
Jan 30, 2026
Viaarxiv icon

Is Softmax Loss All You Need? A Principled Analysis of Softmax-family Loss

Add code
Jan 30, 2026
Viaarxiv icon

Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

HumanLLM: Towards Personalized Understanding and Simulation of Human Nature

Add code
Jan 22, 2026
Viaarxiv icon

Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective

Add code
Jan 21, 2026
Viaarxiv icon

OpenOneRec Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models

Add code
Dec 16, 2025
Figure 1 for From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models
Figure 2 for From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models
Figure 3 for From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models
Figure 4 for From Feature Interaction to Feature Generation: A Generative Paradigm of CTR Prediction Models
Viaarxiv icon

LLM Cache Bandit Revisited: Addressing Query Heterogeneity for Cost-Effective LLM Inference

Add code
Sep 19, 2025
Viaarxiv icon

OmniGen2: Exploration to Advanced Multimodal Generation

Add code
Jun 23, 2025
Viaarxiv icon