Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rafael C. Pinto

Neuroevolution of Self-Attention Over Proto-Objects

Apr 30, 2025

Rafael C. Pinto, Anderson R. Tavares

Abstract:Proto-objects - image regions that share common visual properties - offer a promising alternative to traditional attention mechanisms based on rectangular-shaped image patches in neural networks. Although previous work demonstrated that evolving a patch-based hard-attention module alongside a controller network could achieve state-of-the-art performance in visual reinforcement learning tasks, our approach leverages image segmentation to work with higher-level features. By operating on proto-objects rather than fixed patches, we significantly reduce the representational complexity: each image decomposes into fewer proto-objects than regular patches, and each proto-object can be efficiently encoded as a compact feature vector. This enables a substantially smaller self-attention module that processes richer semantic information. Our experiments demonstrate that this proto-object-based approach matches or exceeds the state-of-the-art performance of patch-based implementations with 62% less parameters and 2.6 times less training time.

* 9 pages, 16 figures, GECCO

Via

Access Paper or Ask Questions

PReLU: Yet Another Single-Layer Solution to the XOR Problem

Sep 17, 2024

Rafael C. Pinto, Anderson R. Tavares

Figure 1 for PReLU: Yet Another Single-Layer Solution to the XOR Problem

Figure 2 for PReLU: Yet Another Single-Layer Solution to the XOR Problem

Figure 3 for PReLU: Yet Another Single-Layer Solution to the XOR Problem

Figure 4 for PReLU: Yet Another Single-Layer Solution to the XOR Problem

Abstract:This paper demonstrates that a single-layer neural network using Parametric Rectified Linear Unit (PReLU) activation can solve the XOR problem, a simple fact that has been overlooked so far. We compare this solution to the multi-layer perceptron (MLP) and the Growing Cosine Unit (GCU) activation function and explain why PReLU enables this capability. Our results show that the single-layer PReLU network can achieve 100\% success rate in a wider range of learning rates while using only three learnable parameters.

Via

Access Paper or Ask Questions