Picture for Peijia Qin

Peijia Qin

LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling

Add code
May 13, 2026
Viaarxiv icon

BioTool: A Comprehensive Tool-Calling Dataset for Enhancing Biomedical Capabilities of Large Language Models

Add code
May 07, 2026
Viaarxiv icon

AIBuildAI: An AI Agent for Automatically Building AI Models

Add code
Apr 15, 2026
Viaarxiv icon

DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding

Add code
Dec 17, 2025
Figure 1 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Figure 2 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Figure 3 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Viaarxiv icon

Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems

Add code
Nov 28, 2024
Figure 1 for Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
Figure 2 for Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
Figure 3 for Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
Figure 4 for Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
Viaarxiv icon

BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation

Add code
Oct 13, 2024
Figure 1 for BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Figure 2 for BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Figure 3 for BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Figure 4 for BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Viaarxiv icon