Picture for Zhi Zheng

Zhi Zheng

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

Add code
Jun 08, 2025
Viaarxiv icon

Rethinking Neural Combinatorial Optimization for Vehicle Routing Problems with Different Constraint Tightness Degrees

Add code
May 30, 2025
Viaarxiv icon

Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification

Add code
May 18, 2025
Viaarxiv icon

Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data

Add code
May 08, 2025
Viaarxiv icon

Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images

Add code
Mar 04, 2025
Viaarxiv icon

Reward-Safety Balance in Offline Safe RL via Diffusion Regularization

Add code
Feb 18, 2025
Viaarxiv icon

Unleashing the Power of Large Language Model for Denoising Recommendation

Add code
Feb 13, 2025
Viaarxiv icon

Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design

Add code
Jan 16, 2025
Viaarxiv icon

CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention

Add code
Nov 30, 2024
Viaarxiv icon