Picture for Mingyi Hong

Mingyi Hong

EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation

Add code
Jun 04, 2025
Viaarxiv icon

Scalable Parameter and Memory Efficient Pretraining for LLM: Recent Algorithmic Advances and Benchmarking

Add code
May 28, 2025
Viaarxiv icon

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems

Add code
May 23, 2025
Viaarxiv icon

Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment

Add code
May 17, 2025
Viaarxiv icon

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

Add code
May 16, 2025
Viaarxiv icon

Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows

Add code
May 07, 2025
Viaarxiv icon

Learning Explainable Dense Reward Shapes via Bayesian Optimization

Add code
Apr 22, 2025
Viaarxiv icon

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

Add code
Mar 15, 2025
Viaarxiv icon

Effectively Steer LLM To Follow Preference via Building Confident Directions

Add code
Mar 04, 2025
Viaarxiv icon

LUME: LLM Unlearning with Multitask Evaluations

Add code
Feb 20, 2025
Viaarxiv icon