Picture for Kun Kuang

Kun Kuang

Sequential Treatment Effect Estimation with Unmeasured Confounders

Add code
May 14, 2025
Viaarxiv icon

Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement

Add code
Apr 12, 2025
Figure 1 for Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement
Figure 2 for Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement
Figure 3 for Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement
Figure 4 for Towards Stepwise Domain Knowledge-Driven Reasoning Optimization and Reflection Improvement
Viaarxiv icon

Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond

Add code
Mar 20, 2025
Figure 1 for Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Figure 2 for Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Figure 3 for Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Figure 4 for Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Viaarxiv icon

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Add code
Mar 14, 2025
Viaarxiv icon

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Figure 1 for Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
Figure 2 for Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
Figure 3 for Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
Figure 4 for Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
Viaarxiv icon

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Add code
Jan 25, 2025
Figure 1 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 2 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 3 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 4 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration

Add code
Jan 10, 2025
Figure 1 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 2 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 3 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 4 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Viaarxiv icon

Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation

Add code
Jan 10, 2025
Viaarxiv icon

Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation

Add code
Jan 06, 2025
Figure 1 for Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Figure 2 for Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Figure 3 for Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Figure 4 for Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Viaarxiv icon