Picture for Boxi Yu

Boxi Yu

OpenRCA 2.0: From Outcome Labels to Causal Process Supervision

Add code
Jun 25, 2026
Viaarxiv icon

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

Add code
May 29, 2026
Viaarxiv icon

Retromorphic Testing with Hierarchical Verification for Hallucination Detection in RAG

Add code
Mar 29, 2026
Viaarxiv icon

CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series

Add code
Oct 26, 2025
Figure 1 for CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series
Figure 2 for CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series
Figure 3 for CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series
Figure 4 for CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series
Viaarxiv icon

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Add code
Jun 10, 2025
Viaarxiv icon

How Should I Build A Benchmark?

Add code
Jan 18, 2025
Viaarxiv icon

Retromorphic Testing: A New Approach to the Test Oracle Problem

Add code
Oct 10, 2023
Viaarxiv icon

Automated Testing and Improvement of Named Entity Recognition Systems

Add code
Aug 14, 2023
Figure 1 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 2 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 3 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 4 for Automated Testing and Improvement of Named Entity Recognition Systems
Viaarxiv icon