Picture for Fred Zhang

Fred Zhang

Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics

Add code
May 13, 2026
Viaarxiv icon

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Add code
Sep 30, 2024
Figure 1 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 2 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 3 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 4 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Viaarxiv icon

Approaching Human-Level Forecasting with Language Models

Add code
Feb 28, 2024
Figure 1 for Approaching Human-Level Forecasting with Language Models
Figure 2 for Approaching Human-Level Forecasting with Language Models
Figure 3 for Approaching Human-Level Forecasting with Language Models
Figure 4 for Approaching Human-Level Forecasting with Language Models
Viaarxiv icon

Adaptive Regret for Bandits Made Possible: Two Queries Suffice

Add code
Jan 17, 2024
Figure 1 for Adaptive Regret for Bandits Made Possible: Two Queries Suffice
Figure 2 for Adaptive Regret for Bandits Made Possible: Two Queries Suffice
Figure 3 for Adaptive Regret for Bandits Made Possible: Two Queries Suffice
Figure 4 for Adaptive Regret for Bandits Made Possible: Two Queries Suffice
Viaarxiv icon

Constant Approximation for Individual Preference Stable Clustering

Add code
Sep 28, 2023
Figure 1 for Constant Approximation for Individual Preference Stable Clustering
Figure 2 for Constant Approximation for Individual Preference Stable Clustering
Figure 3 for Constant Approximation for Individual Preference Stable Clustering
Figure 4 for Constant Approximation for Individual Preference Stable Clustering
Viaarxiv icon

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Add code
Sep 27, 2023
Figure 1 for Towards Best Practices of Activation Patching in Language Models: Metrics and Methods
Figure 2 for Towards Best Practices of Activation Patching in Language Models: Metrics and Methods
Figure 3 for Towards Best Practices of Activation Patching in Language Models: Metrics and Methods
Figure 4 for Towards Best Practices of Activation Patching in Language Models: Metrics and Methods
Viaarxiv icon

Streaming Algorithms for Learning with Experts: Deterministic Versus Robust

Add code
Mar 03, 2023
Viaarxiv icon

Privately Estimating a Gaussian: Efficient, Robust and Optimal

Add code
Dec 15, 2022
Figure 1 for Privately Estimating a Gaussian: Efficient, Robust and Optimal
Figure 2 for Privately Estimating a Gaussian: Efficient, Robust and Optimal
Viaarxiv icon

Optimal Query Complexities for Dynamic Trace Estimation

Add code
Sep 30, 2022
Figure 1 for Optimal Query Complexities for Dynamic Trace Estimation
Figure 2 for Optimal Query Complexities for Dynamic Trace Estimation
Figure 3 for Optimal Query Complexities for Dynamic Trace Estimation
Figure 4 for Optimal Query Complexities for Dynamic Trace Estimation
Viaarxiv icon

Online Prediction in Sub-linear Space

Add code
Jul 16, 2022
Figure 1 for Online Prediction in Sub-linear Space
Viaarxiv icon