Picture for Shwai He

Shwai He

Making Large Language Models Efficient Dense Retrievers

Add code
Dec 23, 2025
Viaarxiv icon

Dense Video Understanding with Gated Residual Tokenization

Add code
Sep 18, 2025
Viaarxiv icon

CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs

Add code
May 19, 2025
Viaarxiv icon

SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning

Add code
Apr 14, 2025
Figure 1 for SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Figure 2 for SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Figure 3 for SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Figure 4 for SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Viaarxiv icon

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts

Add code
Mar 07, 2025
Viaarxiv icon

Fair Diagnosis: Leveraging Causal Modeling to Mitigate Medical Bias

Add code
Dec 06, 2024
Figure 1 for Fair Diagnosis: Leveraging Causal Modeling to Mitigate Medical Bias
Figure 2 for Fair Diagnosis: Leveraging Causal Modeling to Mitigate Medical Bias
Figure 3 for Fair Diagnosis: Leveraging Causal Modeling to Mitigate Medical Bias
Figure 4 for Fair Diagnosis: Leveraging Causal Modeling to Mitigate Medical Bias
Viaarxiv icon

Towards counterfactual fairness thorough auxiliary variables

Add code
Dec 06, 2024
Viaarxiv icon

Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers

Add code
Oct 17, 2024
Viaarxiv icon

What Matters in Transformers? Not All Attention is Needed

Add code
Jun 22, 2024
Viaarxiv icon

Loki: Low-Rank Keys for Efficient Sparse Attention

Add code
Jun 04, 2024
Figure 1 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 2 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 3 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 4 for Loki: Low-Rank Keys for Efficient Sparse Attention
Viaarxiv icon