Picture for Sepp Hochreiter

Sepp Hochreiter

On Subquadratic Architectures: From Applications to Principles

Add code
Jun 10, 2026
Viaarxiv icon

RREDCoT: Segment-Level Reward Redistribution for Reasoning Models

Add code
Jun 04, 2026
Viaarxiv icon

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Add code
May 28, 2026
Viaarxiv icon

Effective Distillation to Hybrid xLSTM Architectures

Add code
Mar 16, 2026
Viaarxiv icon

The Offline-Frontier Shift: Diagnosing Distributional Limits in Generative Multi-Objective Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Adaptive Retrieval helps Reasoning in LLMs -- but mostly if it's not used

Add code
Feb 06, 2026
Viaarxiv icon

AP-OOD: Attention Pooling for Out-of-Distribution Detection

Add code
Feb 05, 2026
Viaarxiv icon

Pre-trained Forecasting Models: Strong Zero-Shot Feature Extractors for Time Series Classification

Add code
Oct 30, 2025
Figure 1 for Pre-trained Forecasting Models: Strong Zero-Shot Feature Extractors for Time Series Classification
Figure 2 for Pre-trained Forecasting Models: Strong Zero-Shot Feature Extractors for Time Series Classification
Figure 3 for Pre-trained Forecasting Models: Strong Zero-Shot Feature Extractors for Time Series Classification
Figure 4 for Pre-trained Forecasting Models: Strong Zero-Shot Feature Extractors for Time Series Classification
Viaarxiv icon

Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation

Add code
Oct 02, 2025
Figure 1 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 2 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 3 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 4 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Viaarxiv icon

xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity

Add code
Oct 02, 2025
Viaarxiv icon