Picture for Kyle Lo

Kyle Lo

Allen Institute for Artificial Intelligence

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Add code
Nov 11, 2024
Figure 1 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 2 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 3 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 4 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Viaarxiv icon

LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions

Add code
Oct 30, 2024
Figure 1 for LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions
Figure 2 for LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions
Figure 3 for LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions
Figure 4 for LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions
Viaarxiv icon

ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models

Add code
Oct 25, 2024
Figure 1 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 2 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 3 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 4 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models

Add code
Sep 04, 2024
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon

Evaluating Language Model Math Reasoning via Grounding in Educational Curricula

Add code
Aug 08, 2024
Figure 1 for Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Figure 2 for Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Figure 3 for Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Figure 4 for Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Viaarxiv icon

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Add code
Jun 26, 2024
Figure 1 for The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources
Figure 2 for The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources
Figure 3 for The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources
Viaarxiv icon

One Thousand and One Pairs: A "novel" challenge for long-context language models

Add code
Jun 24, 2024
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon