Picture for Luyao Niu

Luyao Niu

JobBench: Aligning Agent Work With Human Will

Add code
May 25, 2026
Viaarxiv icon

The WidthWall: A Strict Expressivity Hierarchy for Hypergraph Neural Networks

Add code
May 13, 2026
Viaarxiv icon

Polyhedral Instability Governs Regret in Online Learning

Add code
May 13, 2026
Viaarxiv icon

Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

Add code
May 12, 2026
Viaarxiv icon

ST-ProC: A Graph-Prototypical Framework for Robust Semi-Supervised Travel Mode Identification

Add code
Nov 17, 2025
Viaarxiv icon

Event-CausNet: Unlocking Causal Knowledge from Text with Large Language Models for Reliable Spatio-Temporal Forecasting

Add code
Nov 16, 2025
Viaarxiv icon

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

Temporal Sampling for Forgotten Reasoning in LLMs

Add code
May 26, 2025
Viaarxiv icon

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Add code
May 20, 2025
Figure 1 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 2 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 3 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 4 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Viaarxiv icon