Picture for Marek Cygan

Marek Cygan

NoMagic.AI, Institute of Informatics, University of Warsaw

Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners

Add code
May 29, 2025
Viaarxiv icon

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient

Add code
Feb 07, 2025
Figure 1 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 2 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 3 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 4 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Viaarxiv icon

RoboMorph: Evolving Robot Morphology using Large Language Models

Add code
Jul 11, 2024
Figure 1 for RoboMorph: Evolving Robot Morphology using Large Language Models
Figure 2 for RoboMorph: Evolving Robot Morphology using Large Language Models
Figure 3 for RoboMorph: Evolving Robot Morphology using Large Language Models
Figure 4 for RoboMorph: Evolving Robot Morphology using Large Language Models
Viaarxiv icon

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

Add code
May 25, 2024
Figure 1 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 2 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 3 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 4 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Viaarxiv icon

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Mar 01, 2024
Figure 1 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 2 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 3 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 4 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Viaarxiv icon

A Case for Validation Buffer in Pessimistic Actor-Critic

Add code
Mar 01, 2024
Figure 1 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 2 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 3 for A Case for Validation Buffer in Pessimistic Actor-Critic
Figure 4 for A Case for Validation Buffer in Pessimistic Actor-Critic
Viaarxiv icon

Scaling Laws for Fine-Grained Mixture of Experts

Add code
Feb 12, 2024
Figure 1 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 2 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 3 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 4 for Scaling Laws for Fine-Grained Mixture of Experts
Viaarxiv icon

Decoupled Actor-Critic

Add code
Oct 30, 2023
Figure 1 for Decoupled Actor-Critic
Figure 2 for Decoupled Actor-Critic
Figure 3 for Decoupled Actor-Critic
Figure 4 for Decoupled Actor-Critic
Viaarxiv icon

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

Add code
Oct 24, 2023
Viaarxiv icon

Grasping Student: semi-supervised learning for robotic manipulation

Add code
Mar 08, 2023
Viaarxiv icon