Picture for Chen Yueh-Han

Chen Yueh-Han

Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors

Add code
Jun 12, 2025
Viaarxiv icon

SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts

Add code
May 27, 2025
Viaarxiv icon

Beyond Memorization: Mapping the Originality-Quality Frontier of Language Models

Add code
Apr 13, 2025
Viaarxiv icon

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Add code
Sep 30, 2024
Figure 1 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 2 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 3 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Figure 4 for ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Viaarxiv icon

Approaching Human-Level Forecasting with Language Models

Add code
Feb 28, 2024
Viaarxiv icon