Picture for Ge Li

Ge Li

Karlsruhe Institute of Technology

EvoCoT: Overcoming the Exploration Bottleneck in Reinforcement Learning

Add code
Aug 11, 2025
Viaarxiv icon

RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization

Add code
Jul 31, 2025
Viaarxiv icon

A Survey on Code Generation with LLM-based Agents

Add code
Jul 31, 2025
Figure 1 for A Survey on Code Generation with LLM-based Agents
Figure 2 for A Survey on Code Generation with LLM-based Agents
Figure 3 for A Survey on Code Generation with LLM-based Agents
Figure 4 for A Survey on Code Generation with LLM-based Agents
Viaarxiv icon

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

Add code
Jun 11, 2025
Viaarxiv icon

BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning

Add code
Jun 06, 2025
Viaarxiv icon

SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

Add code
May 22, 2025
Viaarxiv icon

CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning

Add code
May 17, 2025
Figure 1 for CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
Figure 2 for CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
Figure 3 for CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
Figure 4 for CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning
Viaarxiv icon

Rethinking Repetition Problems of LLMs in Code Generation

Add code
May 15, 2025
Viaarxiv icon

Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network

Add code
Apr 10, 2025
Viaarxiv icon

Hierarchical Attention Networks for Lossless Point Cloud Attribute Compression

Add code
Apr 01, 2025
Viaarxiv icon