Picture for Felix Yu

Felix Yu

Jay

Machine Learning on Heterogeneous, Edge, and Quantum Hardware for Particle Physics (ML-HEQUPP)

Add code
Feb 24, 2026
Viaarxiv icon

Spark Transformer: Reactivating Sparsity in FFN and Attention

Add code
Jun 07, 2025
Figure 1 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 2 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 3 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 4 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Viaarxiv icon

Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models

Add code
Apr 12, 2025
Viaarxiv icon

LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization

Add code
Oct 27, 2024
Viaarxiv icon

Baby Bear: Seeking a Just Right Rating Scale for Scalar Annotations

Add code
Aug 19, 2024
Viaarxiv icon

Efficient Document Ranking with Learnable Late Interactions

Add code
Jun 25, 2024
Figure 1 for Efficient Document Ranking with Learnable Late Interactions
Figure 2 for Efficient Document Ranking with Learnable Late Interactions
Figure 3 for Efficient Document Ranking with Learnable Late Interactions
Figure 4 for Efficient Document Ranking with Learnable Late Interactions
Viaarxiv icon

Large Language Models are Interpretable Learners

Add code
Jun 25, 2024
Viaarxiv icon

Metric-aware LLM inference

Add code
Mar 07, 2024
Viaarxiv icon

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Add code
Dec 15, 2023
Viaarxiv icon

SpecTr: Fast Speculative Decoding via Optimal Transport

Add code
Oct 23, 2023
Figure 1 for SpecTr: Fast Speculative Decoding via Optimal Transport
Figure 2 for SpecTr: Fast Speculative Decoding via Optimal Transport
Figure 3 for SpecTr: Fast Speculative Decoding via Optimal Transport
Figure 4 for SpecTr: Fast Speculative Decoding via Optimal Transport
Viaarxiv icon