Picture for Ravid Shwartz-Ziv

Ravid Shwartz-Ziv

UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers

Add code
Feb 03, 2026
Viaarxiv icon

Beyond the Loss Curve: Scaling Laws, Active Learning, and the Limits of Learning from Exact Posteriors

Add code
Jan 30, 2026
Viaarxiv icon

Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact

Add code
Jul 01, 2025
Figure 1 for Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact
Figure 2 for Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact
Figure 3 for Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact
Figure 4 for Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact
Viaarxiv icon

From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning

Add code
May 21, 2025
Figure 1 for From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Figure 2 for From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Figure 3 for From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Figure 4 for From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Viaarxiv icon

Layer by Layer: Uncovering Hidden Representations in Language Models

Add code
Feb 04, 2025
Figure 1 for Layer by Layer: Uncovering Hidden Representations in Language Models
Figure 2 for Layer by Layer: Uncovering Hidden Representations in Language Models
Figure 3 for Layer by Layer: Uncovering Hidden Representations in Language Models
Figure 4 for Layer by Layer: Uncovering Hidden Representations in Language Models
Viaarxiv icon

Video Representation Learning with Joint-Embedding Predictive Architectures

Add code
Dec 14, 2024
Viaarxiv icon

Does Representation Matter? Exploring Intermediate Layers in Large Language Models

Add code
Dec 12, 2024
Viaarxiv icon

Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation

Add code
Dec 10, 2024
Figure 1 for Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Figure 2 for Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Figure 3 for Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Figure 4 for Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Figure 1 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 2 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 3 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 4 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Viaarxiv icon

Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Add code
Oct 10, 2024
Viaarxiv icon