Picture for Piotr Miłoś

Piotr Miłoś

When Does Non-Uniform Replay Matter in Reinforcement Learning?

Add code
May 11, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

Add code
May 03, 2025
Viaarxiv icon

Lightweight Latent Verifiers for Efficient Meta-Generation Strategies

Add code
Apr 23, 2025
Figure 1 for Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Figure 2 for Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Figure 3 for Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Figure 4 for Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Viaarxiv icon

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

Add code
Feb 22, 2025
Figure 1 for Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
Figure 2 for Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
Figure 3 for Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
Figure 4 for Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
Viaarxiv icon

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient

Add code
Feb 07, 2025
Figure 1 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 2 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 3 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Figure 4 for Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
Viaarxiv icon

Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe

Add code
Jun 06, 2024
Figure 1 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 2 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 3 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Figure 4 for Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Viaarxiv icon

What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

Add code
Jun 05, 2024
Viaarxiv icon

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

Add code
May 25, 2024
Figure 1 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 2 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 3 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Figure 4 for Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Viaarxiv icon