Alert button
Picture for Andreas Östling

Andreas Östling

Alert button

The Cambridge Law Corpus: A Corpus for Legal AI Research

Sep 22, 2023
Andreas Östling, Holli Sargeant, Huiyuan Xie, Ludwig Bull, Alexander Terenin, Leif Jonsson, Måns Magnusson, Felix Steffek

We introduce the Cambridge Law Corpus (CLC), a corpus for legal AI research. It consists of over 250 000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. This paper presents the first release of the corpus, containing the raw text and meta-data. Together with the corpus, we provide annotations on case outcomes for 638 cases, done by legal experts. Using our annotated data, we have trained and evaluated case outcome extraction with GPT-3, GPT-4 and RoBERTa models to provide benchmarks. We include an extensive legal and ethical discussion to address the potentially sensitive nature of this material. As a consequence, the corpus will only be released for research purposes under certain restrictions.

* Advances in Neural Information Processing Systems, Datasets and Benchmarks Track, 2023  
Viaarxiv icon

Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants

Jun 12, 2020
Taro Langner, Andreas Östling, Lukas Maldonis, Albin Karlsson, Daniel Olmo, Dag Lindgren, Andreas Wallin, Lowe Lundin, Robin Strand, Håkan Ahlström, Joel Kullberg

Figure 1 for Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants
Figure 2 for Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants
Figure 3 for Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants
Figure 4 for Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants

The UK Biobank is collecting extensive data on health-related characteristics of over half a million volunteers. The biological samples of blood and urine can provide valuable insight on kidney function, with important links to cardiovascular and metabolic health. Further information on kidney anatomy could be obtained by medical imaging. In contrast to the brain, heart, liver, and pancreas, no dedicated Magnetic Resonance Imaging (MRI) is planned for the kidneys. An image-based assessment is nonetheless feasible in the neck-to-knee body MRI intended for abdominal body composition analysis, which also covers the kidneys. In this work, a pipeline for automated segmentation of parenchymal kidney volume in UK Biobank neck-to-knee body MRI is proposed. The underlying neural network reaches a relative error of 3.8%, with Dice score 0.956 in validation on 64 subjects, close to the 2.6% and Dice score 0.962 for repeated segmentation by one human operator. The released MRI of about 40,000 subjects can be processed within two days, yielding volume measurements of left and right kidney. Algorithmic quality ratings enabled the exclusion of outliers and potential failure cases. The resulting measurements can be studied and shared for large-scale investigation of associations and longitudinal changes in parenchymal kidney volume.

Viaarxiv icon