Picture for Yuxin Wang

Yuxin Wang

Stream-T1: Test-Time Scaling for Streaming Video Generation

Add code
May 06, 2026
Viaarxiv icon

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Add code
May 05, 2026
Viaarxiv icon

Beyond Rating: A Comprehensive Evaluation and Benchmark for AI Reviews

Add code
Apr 22, 2026
Viaarxiv icon

AU Codes, Language, and Synthesis: Translating Anatomy to Text for Facial Behavior Synthesis

Add code
Mar 19, 2026
Viaarxiv icon

Generalized Bayes for Causal Inference

Add code
Mar 03, 2026
Viaarxiv icon

Targeted Synthetic Control Method

Add code
Feb 04, 2026
Viaarxiv icon

On the Spectral Flattening of Quantized Embeddings

Add code
Feb 01, 2026
Viaarxiv icon

AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts

Add code
Jan 29, 2026
Viaarxiv icon

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon