Picture for Zheng Li

Zheng Li

Department of Computer Science, Cornell Tech

Sequential Policy Gradient for Adaptive Hyperparameter Optimization

Add code
Jun 18, 2025
Viaarxiv icon

HauntAttack: When Attack Follows Reasoning as a Shadow

Add code
Jun 08, 2025
Viaarxiv icon

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Add code
Jun 04, 2025
Viaarxiv icon

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Add code
May 27, 2025
Viaarxiv icon

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Add code
May 22, 2025
Viaarxiv icon

FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

Add code
May 21, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Add code
May 16, 2025
Viaarxiv icon

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

Add code
May 12, 2025
Viaarxiv icon

FloE: On-the-Fly MoE Inference

Add code
May 09, 2025
Viaarxiv icon