Abstract anzeigen
In the rapidly evolving field of academic research, efficient collaboration between different disciplines is essential for the progress of scientific innovation. Unified metadata facilitates and empowers this synergy, as it can substantially improve the practicality of research data across various scientific areas.
But in practice, metadata is often fragmented and comes in different formats. It is often scattered across multiple databases and systems like PyRAT, RSpace, Zotero, and other research platforms.
This makes the integration and analysis of the data a much more tedious process which leads to inefficiencies and research barriers.
This thesis will incorporate both imperative programming methods and AI-driven approaches, such as Large Language Models (LLMs) and Natural Language Processing (NLP) to automate and improve data management and evaluation. Furthermore, the capabilities of NFDI4Health—the National Research Data Infrastructure for Personal Health Data–will be leveraged to integrate and access streamlined metadata from diverse sources within the scientific research domain.
This thesis presents solutions with the help of the described systems and technologies and compares them in order to make research metadata more accessible to scientists, with the aim of optimizing the standardization of metrics from different formats through automation. Establishing an interoperable data infrastructure facilitates collaboration between researchers and lays the groundwork for larger projects.