Systematic Generalization


Benchmarking Tabular Foundation Models for Conditional Density Estimation in Regression

Add code
Mar 27, 2026
Viaarxiv icon

HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models

Add code
Mar 27, 2026
Viaarxiv icon

How Open Must Language Models be to Enable Reliable Scientific Inference?

Add code
Mar 27, 2026
Viaarxiv icon

CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions

Add code
Mar 27, 2026
Viaarxiv icon

A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

Add code
Mar 26, 2026
Viaarxiv icon

Dynamic LIBRAS Gesture Recognition via CNN over Spatiotemporal Matrix Representation

Add code
Mar 26, 2026
Viaarxiv icon

Analysing Calls to Order in German Parliamentary Debates

Add code
Mar 27, 2026
Viaarxiv icon

Towards Generalizable Robotic Data Flywheel: High-Dimensional Factorization and Composition

Add code
Mar 26, 2026
Viaarxiv icon

The Limits of Learning from Pictures and Text: Vision-Language Models and Embodied Scene Understanding

Add code
Mar 27, 2026
Viaarxiv icon

TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

Add code
Mar 26, 2026
Viaarxiv icon