Picture for Fangzhen Lin

Fangzhen Lin

CogDoc: Towards Unified thinking in Documents

Add code
Dec 14, 2025
Viaarxiv icon

Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP

Add code
Oct 24, 2025
Figure 1 for Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Figure 2 for Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Figure 3 for Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Figure 4 for Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Viaarxiv icon

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Add code
Sep 03, 2025
Figure 1 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 2 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 3 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 4 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Viaarxiv icon

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Add code
May 23, 2025
Figure 1 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 2 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 3 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 4 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Viaarxiv icon

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Add code
May 21, 2025
Figure 1 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 2 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 3 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 4 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Viaarxiv icon

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Add code
Apr 10, 2025
Viaarxiv icon

SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

Add code
Mar 07, 2025
Figure 1 for SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
Figure 2 for SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
Viaarxiv icon

Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems

Add code
Feb 12, 2025
Figure 1 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 2 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 3 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 4 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Viaarxiv icon

The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study

Add code
Feb 11, 2025
Figure 1 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 2 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 3 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 4 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Viaarxiv icon

Adjustable Robust Reinforcement Learning for Online 3D Bin Packing

Add code
Oct 06, 2023
Viaarxiv icon