Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mahsa Shamsabadi

A FAIR and Free Prompt-based Research Assistant

May 23, 2024

Mahsa Shamsabadi, Jennifer D'Souza

Abstract:This demo will present the Research Assistant (RA) tool developed to assist with six main types of research tasks defined as standardized instruction templates, instantiated with user input, applied finally as prompts to well-known--for their sophisticated natural language processing abilities--AI tools, such as ChatGPT (https://chat.openai.com/) and Gemini (https://gemini.google.com/app). The six research tasks addressed by RA are: creating FAIR research comparisons, ideating research topics, drafting grant applications, writing scientific blogs, aiding preliminary peer reviews, and formulating enhanced literature search queries. RA's reliance on generative AI tools like ChatGPT or Gemini means the same research task assistance can be offered in any scientific discipline. We demonstrate its versatility by sharing RA outputs in Computer Science, Virology, and Climate Science, where the output with the RA tool assistance mirrored that from a domain expert who performed the same research task.

* 6 pages, 2 figures, accepted to the Demo track of NLDB 2024 (https://nldb2024.di.unito.it/)

Via

Access Paper or Ask Questions

From Keywords to Structured Summaries: Streamlining Scholarly Knowledge Access

Feb 22, 2024

Mahsa Shamsabadi, Jennifer D'Souza

Abstract:This short paper highlights the growing importance of information retrieval (IR) engines in the scientific community, addressing the inefficiency of traditional keyword-based search engines due to the rising volume of publications. The proposed solution involves structured records, underpinning advanced information technology (IT) tools, including visualization dashboards, to revolutionize how researchers access and filter articles, replacing the traditional text-heavy approach. This vision is exemplified through a proof of concept centered on the ``reproductive number estimate of infectious diseases'' research theme, using a fine-tuned large language model (LLM) to automate the creation of structured records to populate a backend database that now goes beyond keywords. The result is a next-generation IR method accessible at https://orkg.org/usecases/r0-estimates.

* 6 pages, 1 figure

Via

Access Paper or Ask Questions

Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Jan 18, 2024

Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer

Figure 1 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 2 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 3 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 4 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Abstract:In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generation capabilities of LLMs to produce structured scholarly contribution summaries, offering both a practical solution and insights into LLMs' emergent abilities. For LLMs, the prime focus is on improving their general intelligence as conversational agents. We argue that these models can also be applied effectively in information extraction (IE), specifically in complex IE tasks within terse domains like Science. This paradigm shift replaces the traditional modular, pipelined machine learning approach with a simpler objective expressed through instructions. Our results show that finetuned FLAN-T5 with 1000x fewer parameters than the state-of-the-art GPT-davinci is competitive for the task.

* 8 pages, 6 figures, Accepted as Findings of the ACL: EACL 2024

Via

Access Paper or Ask Questions