Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wenlong Jiang

Simultaneous estimation of connectivity and dimensionality in samples of networks

Aug 17, 2025

Wenlong Jiang, Chris McKennan, Jesús Arroyo, Joshua Cape

Abstract:An overarching objective in contemporary statistical network analysis is extracting salient information from datasets consisting of multiple networks. To date, considerable attention has been devoted to node and network clustering, while comparatively less attention has been devoted to downstream connectivity estimation and parsimonious embedding dimension selection. Given a sample of potentially heterogeneous networks, this paper proposes a method to simultaneously estimate a latent matrix of connectivity probabilities and its embedding dimensionality or rank after first pre-estimating the number of communities and the node community memberships. The method is formulated as a convex optimization problem and solved using an alternating direction method of multipliers algorithm. We establish estimation error bounds under the Frobenius norm and nuclear norm for settings in which observable networks have blockmodel structure, even when node memberships are imperfectly recovered. When perfect membership recovery is possible and dimensionality is much smaller than the number of communities, the proposed method outperforms conventional averaging-based methods for estimating connectivity and dimensionality. Numerical studies empirically demonstrate the accuracy of our method across various scenarios. Additionally, analysis of a primate brain dataset demonstrates that posited connectivity is not necessarily full rank in practice, illustrating the need for flexible methodology.

* Main text: 35 pages, 5 figures, 5 tables. Supplement: 26 pages, 3 figures, 5 tables

Via

Access Paper or Ask Questions

Lost-in-Distance: Impact of Contextual Proximity on LLM Performance in Graph Tasks

Oct 02, 2024

Hamed Firooz, Maziar Sanjabi, Wenlong Jiang, Xiaoling Zhai

Figure 1 for Lost-in-Distance: Impact of Contextual Proximity on LLM Performance in Graph Tasks

Figure 2 for Lost-in-Distance: Impact of Contextual Proximity on LLM Performance in Graph Tasks

Figure 3 for Lost-in-Distance: Impact of Contextual Proximity on LLM Performance in Graph Tasks

Figure 4 for Lost-in-Distance: Impact of Contextual Proximity on LLM Performance in Graph Tasks

Abstract:Despite significant advancements, Large Language Models (LLMs) exhibit blind spots that impair their ability to retrieve and process relevant contextual data effectively. We demonstrate that LLM performance in graph tasks with complexities beyond the "needle-in-a-haystack" scenario-where solving the problem requires cross-referencing and reasoning across multiple subproblems jointly-is influenced by the proximity of relevant information within the context, a phenomenon we term "lost-in-distance". We examine two fundamental graph tasks: identifying common connections between two nodes and assessing similarity among three nodes, and show that the model's performance in these tasks significantly depends on the relative positioning of common edges. We evaluate three publicly available LLMs-Llama-3-8B, Llama-3-70B, and GPT-4-using various graph encoding techniques that represent graph structures for LLM input. We propose a formulation for the lost-in-distance phenomenon and demonstrate that lost-in-distance and lost-in-the middle phenomenas occur independently. Results indicate that model accuracy can decline by up to 6x as the distance between node connections increases, independent of graph encoding and model size.

Via

Access Paper or Ask Questions