Picture for Junchi Yao

Junchi Yao

UGID: Unified Graph Isomorphism for Debiasing Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Functional Subspace Watermarking for Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

Add code
Mar 18, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Add code
Dec 30, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images

Add code
Jun 08, 2025
Figure 1 for Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images
Figure 2 for Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images
Figure 3 for Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images
Figure 4 for Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images
Viaarxiv icon

Understanding the Repeat Curse in Large Language Models from a Feature Perspective

Add code
Apr 19, 2025
Viaarxiv icon

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Add code
Feb 18, 2025
Viaarxiv icon