Picture for Piotr Miłoś

Piotr Miłoś

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

Add code
May 03, 2025
Viaarxiv icon

Lightweight Latent Verifiers for Efficient Meta-Generation Strategies

Add code
Apr 23, 2025
Viaarxiv icon

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

Add code
Feb 22, 2025
Viaarxiv icon

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient

Add code
Feb 07, 2025
Figure 1 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 2 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 3 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 4 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Viaarxiv icon

Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe

Add code
Jun 06, 2024
Figure 1 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 2 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 3 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 4 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Viaarxiv icon

What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

Add code
Jun 05, 2024
Viaarxiv icon

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

Add code
May 25, 2024
Figure 1 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 2 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 3 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 4 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Viaarxiv icon

tsGT: Stochastic Time Series Modeling With Transformer

Add code
Mar 15, 2024
Viaarxiv icon

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Mar 01, 2024
Figure 1 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 2 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 3 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Figure 4 for Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Viaarxiv icon