Picture for Kun Kuang

Kun Kuang

General Information Metrics for Improving AI Model Training Efficiency

Add code
Jan 02, 2025
Figure 1 for General Information Metrics for Improving AI Model Training Efficiency
Figure 2 for General Information Metrics for Improving AI Model Training Efficiency
Figure 3 for General Information Metrics for Improving AI Model Training Efficiency
Figure 4 for General Information Metrics for Improving AI Model Training Efficiency
Viaarxiv icon

FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated Learning

Add code
Dec 25, 2024
Viaarxiv icon

Learning Causal Transition Matrix for Instance-dependent Label Noise

Add code
Dec 18, 2024
Figure 1 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 2 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 3 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 4 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Viaarxiv icon

Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator

Add code
Dec 12, 2024
Viaarxiv icon

R3HF: Reward Redistribution for Enhancing Reinforcement Learning from Human Feedback

Add code
Nov 13, 2024
Viaarxiv icon

Causality for Large Language Models

Add code
Oct 20, 2024
Figure 1 for Causality for Large Language Models
Figure 2 for Causality for Large Language Models
Figure 3 for Causality for Large Language Models
Figure 4 for Causality for Large Language Models
Viaarxiv icon

Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs

Add code
Oct 02, 2024
Figure 1 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 2 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 3 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 4 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Viaarxiv icon

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering

Add code
Sep 24, 2024
Figure 1 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 2 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 3 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 4 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Viaarxiv icon

RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU

Add code
Sep 09, 2024
Figure 1 for RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Figure 2 for RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Figure 3 for RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Figure 4 for RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Viaarxiv icon

IntOPE: Off-Policy Evaluation in the Presence of Interference

Add code
Aug 24, 2024
Figure 1 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 2 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 3 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 4 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Viaarxiv icon