Picture for Joshua Maynez

Joshua Maynez

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Add code
Jun 06, 2024
Viaarxiv icon

Bayesian Prediction-Powered Inference

Add code
May 09, 2024
Figure 1 for Bayesian Prediction-Powered Inference
Figure 2 for Bayesian Prediction-Powered Inference
Figure 3 for Bayesian Prediction-Powered Inference
Figure 4 for Bayesian Prediction-Powered Inference
Viaarxiv icon

Learning to Plan and Generate Text with Citations

Add code
Apr 04, 2024
Figure 1 for Learning to Plan and Generate Text with Citations
Figure 2 for Learning to Plan and Generate Text with Citations
Figure 3 for Learning to Plan and Generate Text with Citations
Figure 4 for Learning to Plan and Generate Text with Citations
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

Add code
Nov 15, 2023
Viaarxiv icon

Calibrating Likelihoods towards Consistency in Summarization Models

Add code
Oct 12, 2023
Figure 1 for Calibrating Likelihoods towards Consistency in Summarization Models
Figure 2 for Calibrating Likelihoods towards Consistency in Summarization Models
Figure 3 for Calibrating Likelihoods towards Consistency in Summarization Models
Figure 4 for Calibrating Likelihoods towards Consistency in Summarization Models
Viaarxiv icon

Benchmarking Large Language Model Capabilities for Conditional Generation

Add code
Jun 29, 2023
Figure 1 for Benchmarking Large Language Model Capabilities for Conditional Generation
Figure 2 for Benchmarking Large Language Model Capabilities for Conditional Generation
Figure 3 for Benchmarking Large Language Model Capabilities for Conditional Generation
Figure 4 for Benchmarking Large Language Model Capabilities for Conditional Generation
Viaarxiv icon

$μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge

Add code
May 23, 2023
Figure 1 for $μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge
Figure 2 for $μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge
Figure 3 for $μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge
Figure 4 for $μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge
Viaarxiv icon

SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation

Add code
May 22, 2023
Figure 1 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 2 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 3 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 4 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon