Picture for Minda Hu

Minda Hu

ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback

Add code
May 26, 2025
Viaarxiv icon

A Survey of Personalized Large Language Models: Progress and Future Directions

Add code
Feb 17, 2025
Viaarxiv icon

NILE: Internal Consistency Alignment in Large Language Models

Add code
Dec 21, 2024
Viaarxiv icon

Purple-teaming LLMs with Adversarial Defender Training

Add code
Jul 01, 2024
Viaarxiv icon

Mitigating Large Language Model Hallucination with Faithful Finetuning

Add code
Jun 17, 2024
Figure 1 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 2 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 3 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 4 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Viaarxiv icon

Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Add code
Jun 17, 2024
Figure 1 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 2 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 3 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 4 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Viaarxiv icon

The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing

Add code
Apr 12, 2024
Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Feb 29, 2024
Figure 1 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 2 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 3 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 4 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Viaarxiv icon