TY - GEN
T1 - Digital Repositories and Linked Data
T2 - 1st Iberoamerican Knowledge Graphs and Semantic Web Conference, KGSWC 2019
AU - Gonzalez-Toral, Santiago
AU - Espinoza-Mejia, Mauricio
AU - Saquicela, Victor
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Digital repositories have been used by Universities and Libraries to store their bibliographic, scientific, and/or institutional contents, and then make their corresponding metadata publicly available to the web and through the OAI-PMH protocol. However, such metadata is not descriptive enough for a document to be easily discoverable. Even though the emergence of Semantic Web technologies have produced the interest of Digital Repository providers to publish and enrich their content using Linked Data (LD) technologies, those institutions have used different generation approaches, and in certain cases ad-hoc solutions to solve particular use cases, but none of them has performed a comparison between existing approaches in order to demonstrate which one is the best solution prior to its application. In order to address this question, we have performed a benchmark study that compares two commonly used generation approaches, and also describes our experience, lessons learned and challenges found during the process of publishing a DSpace digital repository as LD. Results show that the straightforward method for extracting data from a digital repository is through the standard OAI-PMH protocol, whose performance in terms of execution time is much shorter than the database approach, while additional data cleaning tasks are minimal.
AB - Digital repositories have been used by Universities and Libraries to store their bibliographic, scientific, and/or institutional contents, and then make their corresponding metadata publicly available to the web and through the OAI-PMH protocol. However, such metadata is not descriptive enough for a document to be easily discoverable. Even though the emergence of Semantic Web technologies have produced the interest of Digital Repository providers to publish and enrich their content using Linked Data (LD) technologies, those institutions have used different generation approaches, and in certain cases ad-hoc solutions to solve particular use cases, but none of them has performed a comparison between existing approaches in order to demonstrate which one is the best solution prior to its application. In order to address this question, we have performed a benchmark study that compares two commonly used generation approaches, and also describes our experience, lessons learned and challenges found during the process of publishing a DSpace digital repository as LD. Results show that the straightforward method for extracting data from a digital repository is through the standard OAI-PMH protocol, whose performance in terms of execution time is much shorter than the database approach, while additional data cleaning tasks are minimal.
KW - Digital repositories
KW - DSpace
KW - Linked Data
KW - OAI-PMH
UR - https://www.scopus.com/pages/publications/85066139306
U2 - 10.1007/978-3-030-21395-4_4
DO - 10.1007/978-3-030-21395-4_4
M3 - Contribución a la conferencia
AN - SCOPUS:85066139306
SN - 9783030213947
T3 - Communications in Computer and Information Science
SP - 41
EP - 55
BT - Knowledge Graphs and Semantic Web - 1st Iberoamerican Conference, KGSWC 2019, Proceedings
A2 - Hidalgo-Delgado, Yusniel
A2 - Villazón-Terrazas, Boris
PB - Springer Verlag
Y2 - 23 June 2019 through 30 June 2019
ER -