Picture for Chi-Heng Lin

Chi-Heng Lin

Transform-Augmented GRPO Improves Pass@k

Add code
Jan 30, 2026
Viaarxiv icon

VOYAGER: A Training Free Approach for Generating Diverse Datasets using LLMs

Add code
Dec 12, 2025
Viaarxiv icon

MossNet: Mixture of State-Space Experts is a Multi-Head Attention

Add code
Oct 30, 2025
Viaarxiv icon

Your contrastive learning problem is secretly a distribution alignment problem

Add code
Feb 27, 2025
Viaarxiv icon

ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning

Add code
Jan 25, 2025
Viaarxiv icon

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing

Add code
Jan 24, 2025
Figure 1 for FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing
Figure 2 for FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing
Figure 3 for FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing
Figure 4 for FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing
Viaarxiv icon

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

MoDeGPT: Modular Decomposition for Large Language Model Compression

Add code
Aug 20, 2024
Figure 1 for MoDeGPT: Modular Decomposition for Large Language Model Compression
Figure 2 for MoDeGPT: Modular Decomposition for Large Language Model Compression
Figure 3 for MoDeGPT: Modular Decomposition for Large Language Model Compression
Figure 4 for MoDeGPT: Modular Decomposition for Large Language Model Compression
Viaarxiv icon

DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling

Add code
May 01, 2024
Figure 1 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 2 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 3 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 4 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Viaarxiv icon

Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

Add code
Feb 18, 2024
Figure 1 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 2 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 3 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 4 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Viaarxiv icon