Picture for Qingyang Zhang

Qingyang Zhang

Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization

Add code
Apr 08, 2025
Figure 1 for Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization
Figure 2 for Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization
Figure 3 for Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization
Figure 4 for Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization
Viaarxiv icon

Educator Attention: How computational tools can systematically identify the distribution of a key resource for students

Add code
Feb 27, 2025
Viaarxiv icon

COME: Test-time adaption by Conservatively Minimizing Entropy

Add code
Oct 12, 2024
Figure 1 for COME: Test-time adaption by Conservatively Minimizing Entropy
Figure 2 for COME: Test-time adaption by Conservatively Minimizing Entropy
Figure 3 for COME: Test-time adaption by Conservatively Minimizing Entropy
Figure 4 for COME: Test-time adaption by Conservatively Minimizing Entropy
Viaarxiv icon

The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection

Add code
Oct 12, 2024
Viaarxiv icon

BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports

Add code
Aug 21, 2024
Figure 1 for BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Figure 2 for BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Figure 3 for BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Figure 4 for BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Viaarxiv icon

Multimodal Fusion on Low-quality Data: A Comprehensive Survey

Add code
Apr 27, 2024
Figure 1 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 2 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 3 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 4 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Viaarxiv icon

Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making

Add code
Dec 18, 2023
Viaarxiv icon

Step-by-Step Remediation of Students' Mathematical Mistakes

Add code
Oct 16, 2023
Figure 1 for Step-by-Step Remediation of Students' Mathematical Mistakes
Figure 2 for Step-by-Step Remediation of Students' Mathematical Mistakes
Figure 3 for Step-by-Step Remediation of Students' Mathematical Mistakes
Figure 4 for Step-by-Step Remediation of Students' Mathematical Mistakes
Viaarxiv icon

Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs

Add code
Jul 22, 2023
Figure 1 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 2 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 3 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 4 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Viaarxiv icon

Provable Dynamic Fusion for Low-Quality Multimodal Data

Add code
Jun 06, 2023
Viaarxiv icon