Picture for Hongwu Peng

Hongwu Peng

Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety

Add code
Jan 12, 2026
Viaarxiv icon

Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training

Add code
Dec 16, 2025
Viaarxiv icon

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS

Add code
Aug 19, 2025
Viaarxiv icon

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning

Add code
Apr 14, 2025
Viaarxiv icon

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models

Add code
Feb 04, 2025
Figure 1 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 2 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 3 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 4 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Viaarxiv icon

APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking

Add code
Jun 20, 2024
Viaarxiv icon

SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud

Add code
Jun 04, 2024
Figure 1 for SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud
Figure 2 for SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud
Figure 3 for SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud
Figure 4 for SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud
Viaarxiv icon

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate

Add code
Feb 05, 2024
Figure 1 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 2 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 3 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Figure 4 for Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Viaarxiv icon

Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM

Add code
Jan 22, 2024
Figure 1 for Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM
Figure 2 for Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM
Figure 3 for Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM
Figure 4 for Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM
Viaarxiv icon

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Add code
Jan 19, 2024
Figure 1 for Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Figure 2 for Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Figure 3 for Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Figure 4 for Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Viaarxiv icon