Picture for Kun Kuang

Kun Kuang

D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Add code
May 28, 2025
Viaarxiv icon

Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

Add code
May 26, 2025
Viaarxiv icon

LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

Add code
May 26, 2025
Viaarxiv icon

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

Add code
May 22, 2025
Viaarxiv icon

Sequential Treatment Effect Estimation with Unmeasured Confounders

Add code
May 14, 2025
Viaarxiv icon

Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement

Add code
Apr 12, 2025
Viaarxiv icon

Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond

Add code
Mar 20, 2025
Viaarxiv icon

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Add code
Mar 14, 2025
Viaarxiv icon

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Viaarxiv icon

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Add code
Jan 25, 2025
Viaarxiv icon