Picture for Zheng Lin

Zheng Lin

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Add code
Apr 22, 2025
Viaarxiv icon

Dynamic Early Exit in Reasoning Models

Add code
Apr 22, 2025
Viaarxiv icon

SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network Framework

Add code
Apr 18, 2025
Viaarxiv icon

NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables

Add code
Apr 09, 2025
Viaarxiv icon

A Diffusion-Based Framework for Occluded Object Movement

Add code
Apr 02, 2025
Viaarxiv icon

State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning

Add code
Mar 26, 2025
Viaarxiv icon

Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks

Add code
Mar 26, 2025
Viaarxiv icon

Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?

Add code
Mar 07, 2025
Viaarxiv icon

BeamLoRA: Beam-Constraint Low-Rank Adaptation

Add code
Feb 19, 2025
Viaarxiv icon

Towards Aligned Data Forgetting via Twin Machine Unlearning

Add code
Jan 15, 2025
Viaarxiv icon