Alert button
Picture for Kyle Richardson

Kyle Richardson

Alert button

TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

Add code
Bookmark button
Alert button
Feb 08, 2024
Yikai Zhang, Siyu Yuan, Caiyu Hu, Kyle Richardson, Yanghua Xiao, Jiangjie Chen

Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Bookmark button
Alert button
Jan 31, 2024
Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo

Viaarxiv icon

Paloma: A Benchmark for Evaluating Language Model Fit

Add code
Bookmark button
Alert button
Dec 16, 2023
Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge

Viaarxiv icon

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Add code
Bookmark button
Alert button
Dec 15, 2023
Dirk Groeneveld, Anas Awadalla, Iz Beltagy, Akshita Bhagia, Ian Magnusson, Hao Peng, Oyvind Tafjord, Pete Walsh, Kyle Richardson, Jesse Dodge

Viaarxiv icon

Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena

Add code
Bookmark button
Alert button
Oct 09, 2023
Jiangjie Chen, Siyu Yuan, Rong Ye, Bodhisattwa Prasad Majumder, Kyle Richardson

Figure 1 for Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Figure 2 for Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Figure 3 for Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Figure 4 for Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Viaarxiv icon

Language Models with Rationality

Add code
Bookmark button
Alert button
May 23, 2023
Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schutze, Peter Clark

Figure 1 for Language Models with Rationality
Figure 2 for Language Models with Rationality
Figure 3 for Language Models with Rationality
Figure 4 for Language Models with Rationality
Viaarxiv icon

DISCO: Distilling Phrasal Counterfactuals with Large Language Models

Add code
Bookmark button
Alert button
Dec 20, 2022
Zeming Chen, Qiyue Gao, Kyle Richardson, Antoine Bosselut, Ashish Sabharwal

Figure 1 for DISCO: Distilling Phrasal Counterfactuals with Large Language Models
Figure 2 for DISCO: Distilling Phrasal Counterfactuals with Large Language Models
Figure 3 for DISCO: Distilling Phrasal Counterfactuals with Large Language Models
Figure 4 for DISCO: Distilling Phrasal Counterfactuals with Large Language Models
Viaarxiv icon

Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

Add code
Bookmark button
Alert button
Nov 15, 2022
Kyle Richardson, Ronen Tamari, Oren Sultan, Reut Tsarfaty, Dafna Shahaf, Ashish Sabharwal

Figure 1 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 2 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 3 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 4 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Viaarxiv icon