Picture for Simran Arora

Simran Arora

Just read twice: closing the recall gap for recurrent language models

Add code
Jul 07, 2024
Viaarxiv icon

Optimistic Verifiable Training by Controlling Hardware Nondeterminism

Add code
Mar 16, 2024
Figure 1 for Optimistic Verifiable Training by Controlling Hardware Nondeterminism
Figure 2 for Optimistic Verifiable Training by Controlling Hardware Nondeterminism
Figure 3 for Optimistic Verifiable Training by Controlling Hardware Nondeterminism
Figure 4 for Optimistic Verifiable Training by Controlling Hardware Nondeterminism
Viaarxiv icon

Simple linear attention language models balance the recall-throughput tradeoff

Add code
Feb 28, 2024
Figure 1 for Simple linear attention language models balance the recall-throughput tradeoff
Figure 2 for Simple linear attention language models balance the recall-throughput tradeoff
Figure 3 for Simple linear attention language models balance the recall-throughput tradeoff
Figure 4 for Simple linear attention language models balance the recall-throughput tradeoff
Viaarxiv icon

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Add code
Feb 14, 2024
Figure 1 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 2 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 3 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 4 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Viaarxiv icon

Zoology: Measuring and Improving Recall in Efficient Language Models

Add code
Dec 08, 2023
Figure 1 for Zoology: Measuring and Improving Recall in Efficient Language Models
Figure 2 for Zoology: Measuring and Improving Recall in Efficient Language Models
Figure 3 for Zoology: Measuring and Improving Recall in Efficient Language Models
Figure 4 for Zoology: Measuring and Improving Recall in Efficient Language Models
Viaarxiv icon

RELIC: Investigating Large Language Model Responses using Self-Consistency

Add code
Nov 28, 2023
Figure 1 for RELIC: Investigating Large Language Model Responses using Self-Consistency
Figure 2 for RELIC: Investigating Large Language Model Responses using Self-Consistency
Figure 3 for RELIC: Investigating Large Language Model Responses using Self-Consistency
Figure 4 for RELIC: Investigating Large Language Model Responses using Self-Consistency
Viaarxiv icon

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Add code
Oct 18, 2023
Figure 1 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 2 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 3 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 4 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Viaarxiv icon

Resources and Evaluations for Multi-Distribution Dense Information Retrieval

Add code
Jun 21, 2023
Figure 1 for Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Figure 2 for Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Figure 3 for Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Figure 4 for Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Viaarxiv icon

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Add code
Jun 20, 2023
Figure 1 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 2 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 3 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 4 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Viaarxiv icon

Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes

Add code
Apr 20, 2023
Figure 1 for Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Figure 2 for Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Figure 3 for Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Figure 4 for Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Viaarxiv icon