Picture for Ivan Titov

Ivan Titov

Layerwise Recurrent Router for Mixture-of-Experts

Add code
Aug 13, 2024
Viaarxiv icon

Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks

Add code
Aug 09, 2024
Viaarxiv icon

Explanation Regularisation through the Lens of Attributions

Add code
Jul 23, 2024
Viaarxiv icon

Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations

Add code
Jul 05, 2024
Viaarxiv icon

Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection

Add code
May 03, 2024
Viaarxiv icon

Unlearning Reveals the Influential Training Data of Language Models

Add code
Jan 26, 2024
Viaarxiv icon

Compositional Generalization for Data-to-Text Generation

Add code
Dec 05, 2023
Figure 1 for Compositional Generalization for Data-to-Text Generation
Figure 2 for Compositional Generalization for Data-to-Text Generation
Figure 3 for Compositional Generalization for Data-to-Text Generation
Figure 4 for Compositional Generalization for Data-to-Text Generation
Viaarxiv icon

Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study

Add code
Nov 16, 2023
Viaarxiv icon

Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation

Add code
Nov 09, 2023
Viaarxiv icon

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training

Add code
Oct 25, 2023
Figure 1 for Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Figure 2 for Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Figure 3 for Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Figure 4 for Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Viaarxiv icon