Picture for Carl Yang

Carl Yang

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Add code
Oct 27, 2025
Figure 1 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 2 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 3 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 4 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Viaarxiv icon

Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals

Add code
Oct 16, 2025
Figure 1 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 2 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 3 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 4 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Viaarxiv icon

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Add code
Oct 09, 2025
Figure 1 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 2 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 3 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 4 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Viaarxiv icon

RAG in the Wild: On the (In)effectiveness of LLMs with Mixture-of-Knowledge Retrieval Augmentation

Add code
Jul 26, 2025
Viaarxiv icon

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs

Add code
Jul 03, 2025
Figure 1 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 2 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 3 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Figure 4 for KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
Viaarxiv icon

Towards a general-purpose foundation model for fMRI analysis

Add code
Jun 11, 2025
Figure 1 for Towards a general-purpose foundation model for fMRI analysis
Figure 2 for Towards a general-purpose foundation model for fMRI analysis
Figure 3 for Towards a general-purpose foundation model for fMRI analysis
Figure 4 for Towards a general-purpose foundation model for fMRI analysis
Viaarxiv icon

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Add code
Jun 04, 2025
Figure 1 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 2 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 3 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 4 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Viaarxiv icon

How to Use Graph Data in the Wild to Help Graph Anomaly Detection?

Add code
Jun 04, 2025
Viaarxiv icon

CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning

Add code
May 25, 2025
Viaarxiv icon

Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes

Add code
Apr 22, 2025
Figure 1 for Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes
Viaarxiv icon