Picture for Jerry Huang

Jerry Huang

How Well Can a Long Sequence Model Model Long Sequences? Comparing Architechtural Inductive Biases on Long-Context Abilities

Add code
Jul 11, 2024
Viaarxiv icon

Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

Add code
May 24, 2024
Viaarxiv icon

Towards Practical Tool Usage for Continually Learning LLMs

Add code
Apr 14, 2024
Figure 1 for Towards Practical Tool Usage for Continually Learning LLMs
Figure 2 for Towards Practical Tool Usage for Continually Learning LLMs
Figure 3 for Towards Practical Tool Usage for Continually Learning LLMs
Figure 4 for Towards Practical Tool Usage for Continually Learning LLMs
Viaarxiv icon

EpiK-Eval: Evaluation for Language Models as Epistemic Models

Add code
Oct 23, 2023
Figure 1 for EpiK-Eval: Evaluation for Language Models as Epistemic Models
Figure 2 for EpiK-Eval: Evaluation for Language Models as Epistemic Models
Figure 3 for EpiK-Eval: Evaluation for Language Models as Epistemic Models
Figure 4 for EpiK-Eval: Evaluation for Language Models as Epistemic Models
Viaarxiv icon

Online Algorithms with Uncertainty-Quantified Predictions

Add code
Oct 17, 2023
Viaarxiv icon

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

Add code
Jul 18, 2023
Figure 1 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 2 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 3 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 4 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Viaarxiv icon

Trust-ya: design of a multiplayer game for the study of small group processes

Add code
Sep 09, 2021
Figure 1 for Trust-ya: design of a multiplayer game for the study of small group processes
Figure 2 for Trust-ya: design of a multiplayer game for the study of small group processes
Figure 3 for Trust-ya: design of a multiplayer game for the study of small group processes
Viaarxiv icon